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HEPATITIS-C VIRUS TESTING 

Technical Field 

The present invention relates to the discovery of new 
types of hepatitis C virus, that we have termed type 3 
(HCV-3) and type 4 (HCV-4) . In particular, it relates to 
the etiologic agent of hepatitis C virus type 3 and 4, and 
to polynucleotides and immunoreactive polypeptides which 
are useful in immunoassays for the detection of HCV-3 and 
HCV-4 in biological samples; and also to the use of 
antigenic HCV-3 and HCV-4 specific polypeptides in 
vaccines . 

Background of the Invention 

Acute viral hepatitis is a disease which may result 
in chronic liver damage. It is clinically diagnosed by a 
well-defined set of patient symptoms, including jaundice, 
hepatic tenderness, and an increase in the serum levels of 
alanine aminotransferase and aspartate aminotransferase. 
Serologic immunoassays are generally performed to diagnose 
the specific type of viral causative agent. 
Historically, patients presenting with symptoms of 
/ hepatitis and not otherwise infected by hepatitis A, 
hepatitis B, Epstein-Barr or cytomegalovirus were 
clinically diagnosed as having non-A, non-B hepatitis 
(NANBH) by default. 



WO 93/10239 



PCT/GB92/02143 



-2 - 

For many years, the agent of non-A, non-B hepatitis 
remained elusive. It has now been established that many 
cases of NANBH are caused by a distinct virus termed 
hepatitis C virus (HCV) . European Patent Application 
EP-A-03 18216 discloses cDHA sequences derived from HCV, 
polynucleotide probes and polypeptides for use in 
immunoassays. Further information is provided in 
European Application EP-A-0388232. 

The HCV genome encodes a large polyprotein precursor, 
which contains structural and non-structural regions. The 
single protein is apparently cleaved into a variety of 
proteins after production. Most of the structural and 
non-structural proteins have now been identified from in 
vitro UNA translation and expression as recombinant 
proteins. The C and E regions encode for nucleocapsid 
structural proteins and for envelope structural proteins, 
respectively. At least five additional regions follow, 
which encode for non-structural (NS) protein of undefined 
function. The organisation is believed to be as follows 
(A. Alberti, Journal of Hepatology, 1991; 12; 279 to 282) 

5' 3 ' 
NCR: C : El : E2 : NS1 : NS2 : NS3 : NS4 : NS5 

Certain immunoreactive proteins have been described as 

recombinant proteins, for example C22 (in the core 

region) , C33 (in NS3 region) , 5-1-1 and C100 (both in the 

NS4 region) , and NS5 (NS5 region) . Diagnosis of 

hepatitis C is still largely based on methods which detect 

antibodies against the product of the C-100 clone. This 
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clone was ligated with overlapping clones to produce a 
larger viral antigen (C100) corresponding to part of the 
NS3-NS4 genomic region. C100 was then fused with the 
human superoxide dismutase (SOD) gene, expressed in use as 
a large recombinant fusion protein (C100-3) and used on 
solid phase to develop radio-labelled (RIA) and 
enzyme-linked immunosorbent assays (ELISA) . 

Polynucleotides useful for screening for HCV are 
disclosed in European Patent Specification EP-A-0398748 . 
European Patent Specification EP-A-0414475 purports to 
disclose the propagation of HCV in culture cells and the 
production of antigens for use in diagnostics. European 
Patent Specification EP-A-0445423 discloses an improved 
immunoassay for detecting HCV antibodies. 

Blood banks in the United Kingdom have recently begun 
routine testing of blood donors for antibodies to 
components of HCV. One assay involves the detection of 
HCV antibodies to C100-3 polypeptides. The C100-3 
antibody recognises a composite polyprotein antigen within 
non-structural regions of the virus and is a consistent 
marker of HCV infection. However, in acute infections 
this antibody is unreliable because of the delay 
(typically 22 weeks) in seroconversion after exposure. 
Furthermore, the CI 00-3 antibody test lacks specificity 
for the hepatitis C virus. 

Second generation antibody tests employ recombinant 
antigens or synthetic linear peptides representing 
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structural antigens from the highly conserved core region 
of the virus as well as non-structural antigens. However, 
it is found that some second-generation ELISA tests can 
yield false-positive reactions. The recombinant 
immunoblot assay (RIBA-2) incorporating four antigens from 
the HCV genome, provides a method for identifying 
genuine-anti-HCV reactivity. However, the result can be 
"indeterminate". The present workers have reported (The 
Lancet, 338; Oct. 19, 1991) varying reactivity of 
HCV-positive blood donors to 5-1-1, C100, C33C and C22 
antigens, and compared these with the results of the 
direct detection of HCV RNA present in the blood samples 
using polymerase chain reaction (PCR) to amplify HCV 
polynucleotides. However, the work demonstrates that the 
unambiguous diagnosis of HCV infections is not yet 
possible. 

Recently there has been discovered a second type of 
HCV (References 1, 2) called K2 that differs considerably 
in sequence from the published prototype (Reference 3) or 
the first type Kl sequences (References 4 and 5) . 

Summary of the Invention 

The present invention is based on the discovery of a 
previously unknown type 3 and 4 variants of HCV, by a 
comparison of sequences amplified by PCR in certain 
regions of the HCV genome and confirmed by phylogenetic 
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analysis. The invention has thus identified 
polynucleotide sequences and polypeptides which are HCV-3 
and HCV-4 specific. These may be used to diagnose HCV-3 
and HCV-4 infection and should thus be included in any 
definitive test for HCV infection. 

One aspect of the invention provides polynucleotide 
sequences unique to hepatitis C virus types 3 and 4 (HCV-3 
and HCV-4). The sequences may be RNA or DNA sequences. 
In principal any HCV-3 or HCV-4 specific polynucleotide 
sequence from non-coding , core, El, E2 or NS1-5 genome 
regions can be used as a hybridisation probe. The 
sequences may be recombinant (i.e. expressed in 
transformed cells) or synthetic and may be comprised 
within longer sequences if necessary. Equally, deletions, 
insertions or substitutions may also be tolerated if the 
polynucleotide may still function as a specific probe. 
Polynucleotide sequences such as core, NS3, NS4 and NS5 
which code for antigenic proteins are particularly useful. 

Another aspect provides an antigenic HCV-3 or HCV-4 
specific polypeptide, particularly from the core, NS3 , NS4 
or NS5 regions (e.g. the HCV-3 or HCV-4 counterparts of 
C100 polypeptide, 5-1-1 polypeptide, C33 polypeptide or 
C22 polypeptide or epitopes thereof) or polypeptides 
including these antigens. 

A further aspect of the invention provides labelled 
antigenic HCV-3 or HCV-4 specific polypeptide (or mixtures 
thereof, particularly from the core and NS4 regions) for 
use in an immunoassay. 
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A further aspect of the invention provides antibodies 
to HCV-3 or HCV-4 specific antigens, particularly 
monoclonal antibodies for use in therapy and diagnosis. 
Thus labelled antibodies may be used for in vivo 
diagnosis. Antibodies carrying cytotoxic agents may be 
used to attack HCV-3 or HCV-4 infected cells. 

A further aspect of the invention provides a vaccine 
comprising immunogenic HCV-3 or HCV-4 specific polypeptide. 

The HCV-3 or HCV-4 specific polynucleotide sequences 
may be used for identification of the HCV virus itself 
(usually amplified by PCR) by hybridisation techniques. 

Oligonucleotides corresponding to variable regions in 
the NS-4 region could be used for type-specific PGR. 
outer sense and inner sense primers may be used in 
combination with the two conserved anti-sense primers for 
a specific detection method for HCV types 1, 2, 3 and 4. 

Immunoreactive HCV-3 or HCV-4 specific polypeptides 
(particularly from the core and NS4 regions) may be used 
to detect HCV-3 and HCV-4 antibodies in biological 
samples , and may also provide the basis for immunogens for 
inclusion in vaccines (especially the El polypeptide) . 
The term "peptide" is used herein to include epitopic 
peptides having the minimum number of amino acid residues 
for antigenicity, through oligopeptides, up to proteins. 
The peptide may be a recombinant peptide expressed from a 
transformed cell, or could be a synthetic peptide produced 
by chemical synthesis. 



WO 93/10239 



PCI7GB92/02143 



- 7 - 

In particular, the invention allows blood donor 
screening by conventional assays (using HCV type 1 encoded 
antigens) to be supplemented with a second test that 
contains two oligopeptides corresponding to first and 
second antigenic regions found in the NS-4 sequence of HCV 
type 3 (positions 1691 to 1708; sequence 
KPALVPDKEVLYQQYDEM and positions 1710 to 1728; sequence 
E CS Q AAPY IEQAQ VIAHQF ) and two derived from the equivalent 
regions of HCV type 2, R(A/V) V(V/I) (A/T) PDKE(I/V) LYEAFDEM 
and EGAS (K/R) AALIEEGQR(M/I) AEML. 

The corresponding HCV-4 antigens from substantially 
positions 1691 to 1708 and 1710 to 1728 may be used for 
HCV-4 detection. 

Thus, the present invention has also identified 
corresponding polynucleotide and polypeptide sequences 
which may be used to identify hepatitis C type 2 viral 
infection. 

Production and detection of the antigen-antibody 
immune complex may be carried out by any methods currently 
known in the art. For example, a labelling system such 
as enzyme, radioisotope, fluorescent, luminescent or 
chemiluminescent labels may be employed, usually attached 
to the antigen. Labelled anti-antibody systems may also 
be used. The recombinant antigen may be either used in 
liquid phase or absorbed onto a solid substrate. 

Oligopeptides corresponding to the antigenic regions 
of all three major types may also be used separately to 
serologically distinguish individuals infected with 
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different HCV types. Such an assay could be in the format 
of an indirect enzyme immunoassay (EIA) that used sets of 
wells or beads coated with oligopeptides of the two major 
antigenic regions for HCV types 4, 3 and 2, and with type 
1 (KPA(V/I) IPDREVLYKEFDEM and RPAV ( I/V) PDREVLYQEFDEM and 
ECSQHLPYIEG (M/A) AEQF) . Minor degrees of cross-reactivity, 
should they exist, can be absorbed out by dilution of the 
test serum in a diluent that contained blocking amounts of 
soluble heterologous-type oligopeptides, to ensure that 
only antibody with type-specific antibody reactivity bound 
to the solid phase. 

Immunogens for use in vaccine formulations may be 
formulated according to techniques currently known in the 
art, including the use of suitable adjuvant and 
immune-stimulation systems. 

Finally, the present invention also encompasses assay 
kits including polypeptides which contain at least one 
epitope of HCV-3 or HCV-4 antigen (or antibodies thereto), 
as well as necessary preparative reagents, washing 
reagents, detection reagents and signal producing reagents. 

Description of the Drawings 

Examples of the invention will now be described by 
way of example only. 

Figures 1 and la give cDNA sequences obtained from 
PCR amplification of a region -255 to -62 of the 5' 
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non-coding region of HCV samples from 18 blood donors and 
a comparison with previously published nucleotide 
sequences (see Table 2); sequence numbering corresponding 
to the prototype HCV-1 sequence (ref.4) and previous 
designations of type 1 or 2 being indicated; 

Figure 2 is a phylogenetic analysis showing 
clustering of the sequences into three types viz; HCV-1, 
HCV-2 and HCV- 3 for the 5' NCR results of Figure 1 using 
the maximum likelihood algorithm, shown as an unrooted 
tree. Numbers 1-18 in full circles correspond to blood 
donor sequences E-bl through E-bl8. Numbers 1 to 26 in 
open circles correspond to the previously published 
sequences identified in Table 2.; 

Figure 3 is a comparison of deduced amino acid 
sequences in the NS-5 region of blood donors (E-bl, E-b2, 
E-b3, E-b7 (type 3) and E-bl2 (type 2) with those 
previously published (Table 2). Amino acid residue 
numbering follows that of the HCV-1 polyprotein (4) and 
uses single letter amino acid codes; 

Figure 4 is a phylogenetic analysis of the NS-5 
region using the maximum likelihood algorithm, shown as an 
unrooted tree. Symbols are as described for Figure 2; 

Figure 5 is a comparison of deduced amino acid 
sequences in the NS-3 region of blood donors (E-bl, E-b2, 
E-b6, E-b7 (type 3) with those previously published 
(Table 2). Group 1/1: amino acid sequence of fl, f3, f4, 
f5, h2, h3, h4 (one), i2, i3, i4, pi, p2 ; Group 1/2: 
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amino acid sequence of i5; Group 1/3: amino acid sequence 
of h2, h3, h4 (one), h5, f2, p3 f il; Group 1/4: amino 
acid sequence of hi (one); Group 1/5: amino acid sequence 
of hi (one) . Numbering, symbols and abbreviations are as 
described for Figure 3; 

Figure 6 is a phylogenetic analysis of the NS-3 
region using the maximum likelihood algorithm, shown as an 
unrooted tree. Representative nucleotide sequences of the 
5 groups of type 1 sequences shown in Figure 5 coded as 
follows: 19 (full circle) i3; 20 (full circle) i4; 21 
(full circle) h5; 22 (full circle) h3; 23 (full circle) 
hi. Symbols are as described for Figure 2; 

Figure 7 is a comparison of deduced amino acid 
sequences in the core region of blood donor E-bl (type 3) 
with those previously published (Table 2) . Numbering, 
symbols and abbreviations are as described for Figure 3; 

Figure 8 is a phylogenetic analysis of the core 
region using the maximum likelihood algorithm, shown as an 
unrooted tree* Symbols are as described for Figure 2. 

Figure 9(a) shows nucleotide, and Figure 9(b) shows 
deduced amino acid sequences of HCV type 3 variants 
amplified from 5 Scottish blood donors (nos. 40, 38, 36, 
26 and 1787) in the putative NS-4 region of HCV 
(nucleotides and amino acid residues numbered as in Choo 
et al., (1991). Nucleotide codes: G: guanidine; C: 
cytidine; A: adenine; U: uridine; amino acid codes: A: 
alanine; R: arginine; N: asparagine; D: aspartic acid; C: 
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cysteine; Q: glutamine; E: glutamic acid; G: glycine; H: 
histidine; I: isoleucine; L: leucine; K: lysine; M: 
methionine; F: phenylalanine; P: proline; S: serine; T: 
threonine; W: trytophan; Y: tyrosine; V: valine.".": 
sequence not determined; difference from consensus shown 
in bold. 

Figure 10(a) shows a comparison of amino acid 
sequences between residues 1679 and 1768 (Choo et al., 
1991) of the three major variants of HCV. T16, T42, T77, 
T1801, T1825: Scottish blood donors infected with HCV 
type 1; T351: Scottish blood donor infected with HCV type 
2; T59, T940, T810: Scottish blood donors infected with 
HCV type 2; T40, T38, T36, T26 f T1787: Scottish blood 
donors infected with HCV type 3; and Figure 10(b) shows 
the derivation of consensus sequences for HCV types 3, 2 
and 1 oligopeptide series. Differences from consensus 
shown in bold. Amino acid codes: A: alanine; R: 
arginine; N: asparagine; D: aspartic acid; C: cysteine; 
Q: glutamine; E: glutamic acid; G: glycine; H: histidine; 
I: isoleucine; L: leucine; K: lysine; M: methionine; F: 
phenylalanine; P: proline; S: serine; T: threonine; W: 
tryptophan; Y: tyrosine; V: valine; " .": not determined. 

Figures 11(a) to 11(c) show amino acid sequences of 
nonameric oligopeptides used for epitope mapping, derived 
from consensus HCV type 3, type 2 and type 1 sequences 
respectively. Amino acid codes: A: alanine; R: arginine; 
N: asparagine; D: aspartic acid; C: cysteine; Q: 
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glutamine; E: glutamic acid; G: glycine; H: histidine; I: 
isoleucine; L: leucine; K: lysine; M: methionine; F: 
phenylalanine; P: proline; S: serine; T: threonine; W: 
tryptophan; Y: tyrosine; V: valine; 

Figures 12a, 12b and 12c show antibody reactivity of 
three sera from blood donors infected with HCV type 3 with 
HCV type 3-encoded oligopeptides in the antigenic region 
of NS-4 (sequences 1-82 shown in Figure 11a) . Antibody 
reactivity to oligopeptides (x-axis) , recorded as optical 
densities in the range from -01 to 0.75 (and >0.75) 
recorded on the y-axis. 

Figure 13 is a comparison of divergent HCV sequences 
with representative type 1, 2 and 3 sequences in variable 
regions of the 5 'NCR. Sequences from -255 to -246 , -215 to 
-186, -115 to -102 and -69 to -62 identical to prototype 
sequence. ».": sequence identity with HCV-1; gap 
introduced in sequences to preserve alignment; tt - w : 
sequence not determined. Origins of sequences : Eg- 1-33 
: Egypt; NL-26 : Holland; HK— 1-4 : Hong Kong; IQ-48: 
Iraq; XX-96 : xxxxx. Figures in parentheses number each 
non-identical sequence. 

Figure 14 is a phylogenetic analysis of the 5 'NCR 
region using the maximum likelihood algorithm, shown as an 
unrooted tree. Sequences 1-17 in solid circles are 
numbered as in Figure 13; previously published sequences 
numbered as in table 1 of (992) . Scottish blood donor 
sequences Eb-l-Eb-12 numbered 51-62 in hollow circles. 
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For clarity, only non-identical sequences are shown in 
tree; e.g. Sequence 1 corresponds to those found in 
samples Eg-16 and Eg-29 etc. (Figure 1) . Hollow squares 
are published sequences from Zaire; Hollow small circles 
are sequences from South Africa; Hollow small solid 
circles are sequences obtained elsewhere in the world. 

Figure 15 A/B is a comparison of nucleotide (A) and 
amino acid (B) sequences in the core region. Symbols as 
for Figure 13. Single letter amino acid codes are used; 

Figure 16 is a phylogenetic analysis of part of the 
core region using the maximum likelihood algorithm, shown 
as an unrooted tree. Sequences are numbered as in Figure 
14; sequence 30 is that of HC-J8 (Okamato et al. Virology 
188:331 -341) ; and 

Figure 17 shows cleavage patterns for A) Haelll/Rsal 
and B) ScrFI in 5 'NCR. 

I) ANALYSIS OF HEPATITIS C VIRUS AND 
PHYLOGENETIC RELATIONSHIP OF TYPES l f 2 and 3 
Introduction 

Sequence analysis of the 5' non coding region of 
hapatitis C virus (HCV) amplified from the plasma of 
individuals infected in Britain revealed the existence of 
three distinct groups of HCV, differing by 9-14% in 
nucleotide sequence. Two of the groups identified were 
similar to those of HCV variants previously termed type 1 
and type 2, while the third group appeared to represent a 
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novel virus type. Sequence comparisons were then made 
between the three virus types in other regions of the 
viral genome. In the NS-5 region, a high degree of 
nucleotide and amino acid sequence diversity was observed, 
with samples classified here as type "3" again forming a 
distinct group that was phylogenetically distinct from 
type 1 and type 2 variants. Type 3 sequences were 
similarly differentiated in the NS-3 and core regions from 
HCV type 1 sequences. The designation of virus types, 
including an observed sub-division of type 1 sequences 
into geographically distinct variants is discussed in 
relation to the new sequence data obtained in this study. 

Discussion 

Replication of nucleotide sequences by polymerase 
chain reaction (PCR) is a recently established technique, 
synthetic complementary primer sequences are hybridised to 
single-stranded DMA on either side of a genome region to 
be copied. The second strand is built up under the action 
of a heat-stable polymerase in the region between the 
primers. Heating then dissociates the two-strands and the 
replication process starts again. The PCR technique 
allows tiny amounts of polynucleotide to the amplified 
provided that there is sufficient sequence information to 
synth.es is e the primar sequences. 

The major problem associated with the use of the PCR 
to assess sequence variation using the PCR is the 
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possibility that mismatches between the primers and the 
variant sequence will prevent amplification. We have used 
several strategies to overcome this problem. For initial 
virus detection, we used primers in the 5 'NCR, which are 
reported to highly conserved amongst type 1 variants (4, 
11, 13, 16, 23, 24, 26, 33), and between Kl and K2 (23). 
Sequence analysis of the blood donors allowed the 
identification of type 1 and type 2 variants by comparison 
with published sequence data. This analysis also revealed 
the existence of a third "type" of HCV that appeared to be 
as distinct from type 1 as type 2 was (Figs. 1,2; Table 3); 
Based on our initial tentative classification, we sought 
corroboration of our findings in other (coding) and more 
variable regions of the viral genome. 

Analysis of the NS-5 region, which was based on 
several sequences of each of the three types (Figs. 3, 4; 
Table 3), confirmed the existence of 3 major groups, with 
type 3 sequences forming a relatively homogeneous group 
that was quite distinct from types 1 and 2. The proposed 
separation of type 1 sequences into PT and Kl "sub- types" 
and type 2 sequences into K2a and K2b is supported by this 
analysis, in which the single type 2 blood donor sequence 
obtained in this study appears most similar to K2b. 
Differentiation of HCV type 1 sequences into two groups is 
also clearly shown in the core (Fig. 7) and NS-3 regions 
(Fig. 5), in both cases with the type 3 sequences appearing 
considerably more distant. 
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The clustering of phylogenetically distinct groups, 
their mixed distributions in a single geographic area (1, 
7, 23, 27, 35) and our own finding of dual or triple 
infections in individual haemophiliacs all strongly 
suggest that the three types described here are distinct 
viruses rather than simply representing geographical or 
epidemiologically clustered variants of a single, highly 
variable but monophyletic group. 

our own phylogenetic analysis of the 5 'NCR reveals 
the existence of three distinct groups. This contrasts 
with analyses of coding region, where there appears to be 
a very prominent differentiation of type 1 sequences into 
two "subtypes". However, unlike type 2 and 3 variants, 
the two subtypes are geographically distinct, one sub-type 
comprising sequences obtained exclusively from Japanese 
patients, and the other comprising predominantly 
USA/European sequences (Table 2) . Indeed the only 
exception to this geographical classification is the HC-J1 
sequence (26) ; one apparent exception (Pt-1) was obtained 
from a Japanese haemophiliac treated with imported factor 
vm of USA origin (7,23), which is likely to have 
contained HCV variants corresponding to the other 
sub-type. There is insufficient sequence data to indicate 
whether the two proposed type 2 subtypes, K2a and K2b 
(7,23) represent geographically distinct variants. 

The genomic organisation of HCV corresponds to that 
of flaviviruses and pestiviruses, with a single open 
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reading frame encoding a polyprotein that is subsequently 
cleaved into structural and non-structural proteins. Weak 
sequence homologies have been detected with several other 
virus groups that have positive-sense RNA genomes 
(19,21). Although the overall degree of sequence 
dissimilarity between types 1, 2 and 3 cannot be measured 
by comparison of the small regions of sequence analysed in 
this study, a rough estimate of the extent of divergence 
in protein coding regions is given by an examination of 
the divergence of the partial core sequence. This shows 
that the difference between HCV type 1 and type 3 core 
region (approximately 10% amino acid sequence divergence) 
is comparable to that which exists between different 
serotypes of the flavivirus, tick-borne encephalitis virus 
(14%; ref.20), but lower than that which is found between 
serotypes of a mosquito borne flavivirus, dengue fever 
virus (33%), and the West Nile (WN) subgroup (28-43% 
divergence). The 5 'NCR sequences of the different members 
of WN subgroup are also considerably more diverse than 
those of the three types of HCV (=50% similarity; ref.5), 
although within each of the members e.g. Murray Valley 
encephalitis virus, the 5 'NCR is extremely well conserved 
(>95% similarity; ref.5). On the basis of these 
analogies, we speculate that the major types of HCV 
represent distinct "serotypes", each capable of human 
infection irrespective of the immune response mounted 
against other HCV types. 
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METHODS 



samples . Plasma from 18 different blood donors (E-bl 
through E-bl8) , that were repeatedly reactive on screening 
by Abbott 2nd generation enzymeimmunoassay (EIA) , and 
confirmed or indeterminate by a recombinant immunoblot 
assay (RIBA; Ortho; ref 1) were the principal samples used 
in this study. Sequences in the NS-3 region from 5 
anti-HCV positive IVDDs (abbreviated as il - i5 in ref. 
31), 5 haemophiliacs who had received non-heat treated 
clotting concentrate, and who were also anti-HCV positive 
(hl-h5) , 3 pools of 1000 donations collected in 1983 
(pl-p3), and 5 separate batches of commercially available 
non-heat treated factor VIII (fl - f5) correspond to those 
described previously (31) . 

Primers. The primers used for cDNA synthesis and 
polymerase chain reaction (PCR) are listed in Table 1. 
They were synthesised by Oswel DNA Service, Department of 
Chemistry, University of Edinburgh. 



RNA Extraction and PCR. HCV virions in 0.2-1.0 ml volumes 
of plasma were pelleted from plasma by ultracentrifugation 
at 100,000g for 2 hours at 4°C. RNA was extracted from 
the pellet as previously described (2,31). First strand 
cDNA was synthesized from 3ul of RNA sample at 42°C for 30 
min. with 7 units of avian myeloblastosis virus reverse 



WO 93/10239 



PCT/GB92/02143 



- 19 - 

transcriptase (Promega) in 20ul buffer containing 50 mM 
Tris-HCl (pH 8.0), 5mM MgCl 2 , 5 mM dithiothreitol, 50 mM 
KC1, 0.05 ug/ul BSA, 15% DMSO, 600 uM each of dATP, dCTP, 
dGTP and TTP, 1.5 uM primer and 10 U RNAsin (Promega). 

PCR was performed from 1 ul of the cDNA over 25 
cycles with each consisting of 25 sec. at 94°C, 35 sec. at 
50°C and 2.5 min. at 68°C. The extension time for the 
last cycle was increased to 9.5 min. The reactions were 
carried out with 0.4 unit Taq polymerase (Northumbria 
Biologicals Ltd.) in 20 ul buffer containing 10 mM 
Tris-HCl , pH 8.8, 50 mM KC1, 1.5 mM MgCl 2 , 0.1% Triton 
X-100, 33 UM each of dATP, dCTP, dGTP and dTTP and 0.5 uM 
of each of the outer nested primers. One ul of the 
reaction mixture was then transferred to a second tube 
containing the same medium but with the inner pair of 
nested primers, and a further 25 heat cycles were carried 
out with the same programme. The PCR products were 
electrophoresed in 3% low melting point agarose gel (IBI) 
and the fragments were detected by ethidium bromide 
staining and UV illumination. For sequence analysis, 
single molecules of cDNA were obtained at a suitable 
limiting dilution at which a Poisson distribution of 
positive and negative results was obtained (30) . 

Direct Sequencing of PCR Products . The PCR products were 
purified by glass-milk extraction ("GeneClean" ; BiolOl, 
Inc.). One quarter of the purified products was used in 
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sequencing reactions with T7 DNA polymerase (Sequenase; 
United States Biologicals) performed according to the 
manufacturer's instructions except that the reactions were 
carried out in 10% DMSO and the template DNA was heat 
denatured before primer annealing. 

Phvlooenefcic Methods . The sequences were compiled by 
version 2.0 of the programs of Staden (32) and analysed by 
programs available in the University of Wisconsin Genetics 
Computer Group sequence analysis package, version 7.0 
(6) . Phylogenetic trees were inferred using two different 
programs available in the PHYLIP package of Felsenstein 
(version 3,4 June 1991; ref.9). The program DNAML finds 
the tree of the highest likelihood (the maximum likelihood 
tree) given a particular stochastic model of molecular 
evolution and has been shown to perform well in simulation 
studies (28). In the analyses performed here the global 
(G) option was used as this searches a greater proportion 
of all possible trees. The second program used was 
NEIGHBOR which clusters (following the algorithm of Saitou 
& Nei; ref.29) a matrix of nucleotide distances previously 
estimated using the program DNADIST (which itself was set, 
using the D option, to use the same stochastic model as 
underlies DNAML in order to estimate distances corrected 
for the probabilities of multiple substitution) . In all 
cases the maximum likelihood and neighbour joining 
procedures produced congruent trees and thus only the 
former have been presented here. 
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To establish the interrelationships of the major 
types of HCV, we have separately analysed several regions 
of the viral genome that differ in sequence variability 
and evolutionary constraint. Thus the conclusions drawn 
from the sequence comparisons are not subject to spurious 
evolutionary phenomena that may affect a particular 
region. However , one problem with the analysis presented 
here was the absence of a viral sequence that was 
sufficiently distantly related to HCV to serve as an 
out-group. Thus, although we describe the 
interrelationships of different sequence variants of HCV, 
it should be stressed that we have no means of deciding 
which sequence is ancestral to the others. The trees are 
thus drawn in the less familiar un-rooted form to indicate 
this. 

RESULTS 

1) Analysis of the 5 / non-coding region . Samples were 
obtained from 18 blood donors that were repeatedly 
reactive in the Abbott 2nd Generation enzyme immunoassay 
and which were confirmed or indeterminate in the Chiron 
4-RIBA (E-bl through E-bl8, ref.10). HCV sequences 
present in stored plasma samples from each donor were 
amplified with primers corresponding to sites in the 5 'NCR 
(12 f 25) that are well conserved between all known HCV type 
1 and type 2 variants (4 , 11, 13 , 16 f 23 , 24 f 26 f 33) . 
Sequencing of the PCR product, after limiting dilution to 
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isolate single molecules of cDNA before amplification, 
allowed approximately 190 bps in the centre of the region 
to be compared with equivalent published sequences (fig.l). 

Within the sequences, constant as well as variable 
regions can be found. Six sequences from donors E-bl3 
through E-bl8 closely resembled those previously described 
as type 1 (4,11,13,16,23,24,26,33) and others resembled 
type 2(23) sequences (E-b9 through E-bl2) . However, eight 
sequences (E-bl through E-b8) were distinct from both 
types, and have been provisionally termed type 3. 
Division of the sequences into three types is supported by 
formal phylogenetic analysis using the maximum likelihood 
(fig. 2) and neighbour joining algorithms (data not shown) 
of the blood donor sequences along with previously 
published sequences (identified in table 2) . Sequence 
variability within the three groups is in each case 
considerably less than that which separates the types. No 
sequence intermediate between the three types were found. 
This tree shows that the provisionally identified type 3 
group is equally distinct from type l as is type 2. Using 
the DNAML model, the corrected distances between sequences 
within each type were in each case less than 3%. Between 
groups, they ranged from 9% (between type 1 and type 3, 
and between type 1 and 2) , to 14% between type 2 and type 
3 (table 3). 
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2) Analysis of the NS-5 Region . The nucleotide sequence 
of the NS-5 region has been found to vary signif icantly 
between the previously described Kl and K2 variants of HCV 
(7) • To investigate whether type 3 sequences were equally 
distant from the other two types in this region as well as 
in the 5 'NCR, we compared sequences from four type 3 blood 
donors (E-bl, E-b2, E-b3 and E-b7) and one type 2 donor 
(E-bl2) with previously published sequences (Fig. 3; 
Fig. 4; table 3) . 

A remarkable variation was observed between sequences 
of the three types in this region. Again, type 3 
sequences form a separate group from type 1 and type 2 in 
this region. However, unlike the 5 'NCR, there appear to 
be subdivisions within the type 1 and type 2 groups. Type 
1 sequences are split between those found in Japanese 
infected individuals (e.g. HCV-J; HCV-BK; sequence numbers 
12, 13, 16-20 in table 2) and those of U.S.A. origin 
(HCV-1, Pt-I, H77, H90; sequence numbers 1-4; fig. 4) . 
There is also some evidence for a split between type 2 
sequences, those corresponding to their previous 
designation as K2a (7) appearing distinct from type K2b 
sequences and the Scottish blood donor, E-bl2. 

Table 3 shows that the average nucleotide distances 
between the two groups of HCV type 1 sequences is 25% 
(indicated here as type la [USA] and type lb [Japanese]), 
with variation of only 4-7% within each group. The 
nucleotide sequence divergence within the two type 1 
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groups is similar to that which exists between K2a and K2b 
(table 3). However, both of these distances are 
considerably less than those which exist between type 1 
and type 2 sequences (52-62%) , and type 3 (48-49%) , and 
the distance between type 2 and type 3 sequences (53-60%) . 



3) Analysis of the NS-3 region . Amplification reactions 
were carried out using previously published primer 
sequences in the NS-3 region (37), and a pair of 
empirically derived inner primers (31) . Although these 
primers amplified HCV sequences from a high proportion of 
anti-C-100 positive sera from haemophiliacs (31) , they 
were less effective with sera from IVDDs (31) , and with 
blood donor samples (3 positive out of 15 tested? data not 
shown) . Two conserved sites in the amplified fragment 
were identified by sequence analysis of the NS-3 region 
from the haemophiliac and IVDD patients, and two new 
primers corresponding to these were specified (207, 208; 
Table 1) . The combination of 288-208 (first round) and 
290-207 (second round) primers successfully amplified 
samples from four donors infected with HCV type 3 (E-bl, 
E-b2, E-b6 and E-b7) but none of those infected with HCV 
type 2 (data not shown) . This enabled a comparison of the 
new type with our own (31) and previously published type 1 
sequences (figs. 5, 6; table 3). For clarity, only seven 
of the type 1 sequences obtained in this study (E-bl6, 
E-bl7, i3, i3, h5, h3 and hi) are shown in the tree. 
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These sequences are representative of the range of 
variation found in this region in individuals infected in 
Britain; comparison of the tree previously published (31) 
with fig. 6 shows that the former forms a very small 
component of the overall tree obtained once Japanese type 
1 and type 3 sequences are added. 

The maximum likelihood tree shows that type 1 and 
type 3 have diverged considerably from each other. As was 
found in the NS-5 region, subtypes of type 1 sequences are 
found in NS-3. Again, sequences of Japanese origin 
(HCV-J, HCV-BK and JH) are distinct from the prototype 
(FT) sequence, and those found in Scottish blood donors 
(E-bl6, E-bl7, pl-3), IVDUs (il-5) and haemophiliacs 
(hl-5) , all of which correspond to the prototype sequence 
(fig. 5). However, the average subtype difference (23%) is 
lower than those that exist between HCV-l and HCV-J with 
the four type 3 sequences (37-43%) . As reported 
previously (31) , the majority of nucleotide substitutions 
that exist between type 1 sequences are silent (i.e. do not 
affect the encoded amino acid sequence) , while numerous 
amino acid substitutions exist between type 1 and type 3 
sequences (fig. 5). The analysis of the NS-3 region 
includes the sequence of clone A (35) which was obtained 
from Japanese patients with NANB hepatitis, and which was 
reported to be distinct from existing HCV type 1 
sequences. In Fig. 6, this sequence appears to be 
distinct from both HCV type 1 and type 3, with corrected 
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sequence distances of 33-43% and 36% respectively. 
Although it is not possible to assign this sequence to any 
known group at this stage, these distances are not 
inconsistent with the hypothesis that it represents a type 

2 sequence, or an equally distinct novel HCV type. 

4) Partial Sequence of the Putat ive Core Region of HCV. 

The region encoding the putative core protein is 
comparatively well conserved in its nucleotide sequence 
between known type 1 variants, showing nucleotide and 
amino acid sequence similarities of 90-98% and 98-99% 
respectively (11, 24) . Part of the core region from the 
blood donor Ebl, who has type 3 sequences in other regions 
analysed was amplified with primers 410 and 406 and 
compared with previously published type 1 sequences 
(fig. 7, 8; table 3). This analysis confirms that the type 

3 sequence was distinct from those of type 1, and again 
there was a prominent subdivision of type 1 sequences into 
Japanese (HCV- J, HCV-BK, HC-J4, JH and J7) and 
USA/European (HCV-1, H77, H90, GM1, GM2) sequences. As 
was found in NS-3, very little amino acid sequence 
variation is found in the core regions of type 1 
sequences; almost all of the nucleotide differences 
between the two groups are at "silent" sites. By 
contrast, the type 3 sequence shows 7-8 amino acid 
substitutions on comparison with type 1 sequences. 
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TABLE 1 



SEQUENCES AND SOURCES OF PRIMERS USED FOR AMPLIFICATION OF HCV GENOME. 



Name 



Reeion 



Position Sense" Sequences 5' -3' 
of 5*base J 



Ret. 



209 


5'NCR 


8 


211 


5*NCR 


-29 


939 


5'NCR 


-297 


940 


5'NCR 


-279 



ATACTCGAGGTGCACGGTCTACGAGACCT (12) 

CAC 1 CTCGAGCACCCTATCAGGCAGT (12) 

CTGTGAGGAACTACTGTCTT (25) 

TTCACGCAGAAAGCGTCTAG (25) 



410 
-406 



CORE 
CORE 



410 
-21 



ATGTACCCCATGAGGTCGGC 
AGGTCnCOTAGACCGTGCATCATGAGCAC 



-288 


NS-3 


4951 




CCGGCATGCATGTCATGATGTAT 


190 


NS-3 


4932 




GTATTTGGTGACTGGGTGCGTC 


208 


NS-3 


4662 


+ 


TCTTGAATTTTGGGAGGGCG 1C 1 1 


207 


NS-3 


4699 


+ 


CATATAGATGCCCACTTCCTATC 



(31) 
(31) 



007 
-220 



NS-4 
NS-4 



HCV type 3H 

221 NS-4 

008 NS-4 

HCV type 2: 

281 NS-4 

509 NS-4 



5293 
5278 



4858 
4878 



4858 
4878 



AACICC«AGTATCCCACTGATGAAGTTCCACAT 
CACATGTGCTTCGCCCAGAA 



GGACCTACGCCCCTTCTATA 
TCGGTTGGGGCCTGTCCAAAATG 



GGTCCCACCCCTCTCCTGTA 
CCGCTTGGGTTCCGTTACCAACG 



HCVtvpe 1: 

253 NS-4 4858 - GGGCCAACACCCCTGCTATA 

196 NS-4 4878 + CAGACTGGGCGCCGTTCAGAATG 
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Table 1 cont. 



242 


NS-5 


8304 


555 


NS-5 


8227 


243 


NS-5 


7904 


554 


NS-5 


7935 



+ 



GGCGGAATTCCTGGTCATAGCCTCCGTGAA i7) 

CCACGACTAGATCATCTCCG 
TGGGGATCCCGTATGATACCCGCTGCTTTGA (7) 



+ CTCAACCGTCACTGAACAGGACAT 



•Position of 5'base rdarive to HCV genomic sequence in rcf. no. (4) 
"Orientation of primer sequence (+: sense: -: anti-sense) 



± Abbreviations: A: adenine c: cyridine: G: gnanidinc, T: thymidine. 

f Separate sense primers required to enable amplification of each HCV type. 
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TABLE 2 

SOURCE AND CITATION OF PREVIOUSLY PUBLISHED HCV SEQUENCES USED IN THIS STUDY 



Geographical 



No. 


Type 


Abbreviation 


Source 


Reference 


Ref. 


1 


1 


HCV-1 


U.S.A 


Choo et al.. 1991 


(4) 


2 


1 


Pt-1 


Japan 


Nakao et al.. 1991 


(23) 










Enomoto et al.. 1990 


(7) 


3.4 


1 


H77. H90 


U.S.A. 


Ogata etaL. 1991 


(24) 


5.6 


1 


GM-1. GM-2 


Germany 


FvcbsetaL. 1991 


(11) 


7 


1 


Jl 


Japan 


Han et al.. 1991 


(13) 


8 


1 


Al 


Australia 


Han et al.. 1991 


(13) 


9 


1 


SI 


S. Africa 


Han et al.. 1991 


(13) 


10 


1 


Tl 


Taiwan 


Han et al.. 1991 


(13) 


11 


1 


U 18/124 


U.S.A/Italy 


Han et al.. 1991 


U3) 


12 


1 


HCV-J 


Japan 


fCato et al.. 1990 


(16) 


13 


1 


HCV-BK 


Japan 


Takamizawa et al.. 1991 


(33) 


14.15 


1 


HC-J1. -J4 


Japan 


Okamoto et al.. 1990 


(26) 


16-20 


1 


Kl. Kl- 1-4 


Japan 


Enomoto et al.. 1990 


(7) 


21 


1 


JH 


Japan 


Kubo et al.. 1990 


(17) 


22 


1 


J7 


Japan 


Takeuchi et al.. 1990 


(34) 


23-26 


2 


K2a. K2a-1. 


Japan 


Nakao etaL. 1991 


(23) 






K2b. K2b-1 




Enomoto et al.. 1990 


(7) 


27 




Clone A 


Japan 


Tsukiyama-Kohanu 1991 


(35) 
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TABLE 3 



NUCLEOTIDE DISTANCES BETWEEN THE THREE HCV TYPES 
IN FOUR REGIONS OF THE GENOME. 



REGION 


TYPES (n 1 *) 


la 


lb 


2a 


2b 


3 




5*NCR 


1 (20) 


0.0163 


n/a* 










2(6) 


0.0869 


n/a 


0.0214 








3(8) 


0.0948 


n/a 


0.1331 


n/a 


0.0123 


CORE 


la (6) 


0.0358 












lb (5) 


0.0855 


0.0227 










3(1) 


0.1801 


0.1511 


n/d c 


n/d 


0.0000 


NS-3 


la (34) 


0.0699 












lb (3) 


0.2270 


0.0535 










3(4) 


0.3689 


0.4279 


n/d 


n/d 


0.0460 


NS-5 


la (4) 


0.0743 












lb (7) 


0.2477 


0.0372 










2a (2) 


0.6092 


0.6206 


0.0612 








2b (3) 


0.5214 


0.5732 


0.2252 


0.0655 






3(4) 


0.4754 


0.4890 


0.5983 


0.5299 


0.0322 



^number of sequences analysed 
Va : not applicable 
"in/d : not done 
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II). SEROLOGICAL REACTIVITY OF BLOOD DONORS INFECTED WITH 

THREE DIFFERENT TYPES OF HEPATITIS C VIRUS. 

HCV sequences were amplified in the 5 'non-coding 
region (5 'NCR), core, NS-3 and NS-5 regions from blood 
donors, haemophiliacs and intravenous drug abusers. 

Blood donations that were repeatedly reactive on 
screening with Abbott 2nd generation enzyme immunoassay 
(EIA) and positive or indeterminate by Ortho recombinant 
immunoblot assay (RIBA) were amplified by primers in the 
5 'NCR (reference 10) . The first fourteen PCR-positive 
blood donations (where PCR was used to amplify and thus 
detect HCV RNA present in the blood) were then typed by 
sequence analysis of the amplified region, and compared 
with their serological reactivity to a range of structural 
and non-structural peptides in two 1st generation EIAs 
(Ortho HCV ELISA; Abbott HCV EIA) and two RIBA assays 
(Ortho RIBA and Innogenetics LIA; Table 4) . The five 
donations containing HCV type 1 sequences were positive in 
both EIAs, reacted with all antigens in the Ortho RIBA 
assay, and were broadly reactive in the LIA. However, all 
but two of the sera from donors with type 2 and 3 
infections were completely negative on anti-ClOO EIA 
screening and failed to react with 5-1-1, C100 (RIBA) and 
NS4 (LIA). 
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Furthermore, some carriers of HCV type 3 variants 
reacted poorly with the C33 (NS-3) peptide in the Ortho 
RIBA, and yielded two "indeterminate" results (donor nos. 
11 and 13) . 

Thus, current tests using Ortho RIBA and (to a lesser 
extent) Innogenetics LIA tests are unable to reliably 
detect HCV-2 and HCV-3 genotypes. For reliable testing 
for all HCV types, antigens from 5-1-1, C100 and NS4 for 
each of the three types of HCV should preferably be 
included in the panel of antigens. 
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TABLE 4 



SEROLO GICAL REACTIVITY OF SERA FROM BLOOD DONORS 
INFECTED WITH THREE TYPES OF HEPATITIS C VIRUS 



Donor 
Bomber 


HCV 
genotype 


on 

CI 


ti- 
00 

At 


Ortno RXBA 
5-1-1 C100 C33 C22 


Innogeneticfl LXA 
NS4 HS5 CI* C2 C3 C4 


B-bl3 
B-bl5 
B-bl6 
B-bl7 
B-bl8 


1 


♦ 




35 4 4 4 


2S 3 1 2 1 1 
2 3 3 2 2 1 

2 3 2 3 3 - 

3 3 3 2 1 1 
3 - 2 1 1 


B-b9 
B-bXO 
B-bll 
B-bl2 


2 






13 4 


- - 3 1 1 3 
3 2 2 2 - 
3 4 2 2 3 
13 12 2 


E-bl 
E-b2 
B-b3 
B-bS 
E-b7 


3 






4 4 
2 4 
2 4 


- 13 1-3 
2 1112 

2 2 1 2 2 1 
3 12 3 

- 2 3 114 



*Ortho HCV ELBA (Recombinant CI 00-3) 

tAbbott HCV EIA (Hepatitis C Recombinant DNA Antigen) 

JCore oligopeptides. 1-4 

§Bands scored - (negative) to 4 (strong positive) according to manufacturers instructions. 
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PART HI. MAPPING OP ANTIGENIC DETERMI NANTS IN NS-4 



Introduction 

With an overall aim of improving serological 
screening assays, we have obtained sequence data from the 
antigenic region of region corresponding to clOO-3 for 
types 2 and 3 . This information was used to epitope map 
the region, to define additional immunoreactive peptides 
that could be used to improve serological anti-HCV assays. 



Methods: 

pgr and sequencing . Plasma samples from Scottish blood 
donors yielding repeatedly reactive donations on 2nd 
generation anti-HCV screening (Abbott or Ortho) , and which 
were confirmed or indeterminate on confirmatory testing by 
RIBA (Chiron) were referred to the Department of Medical 
Microbiology from the Scottish National Blood Transfusion 
Service Microbiology Reference Laboratory. HCV RNA within 
the plasma samples was extracted and amplified with 
primers in the 5 'NCR as described previously (Chan et al., 
1992) . HCV was typed by sequence analysis of the 
amplified DNA as described previously (Simmonds et al., 
1990) and by RFLP analysis. 

Five samples from different donors infected with HCV 
type 3 (nos. 40, 38, 36, 26 and 1787), four infected with 
type 2 (nos. 351, 59, 940 and 810) and five with type 1 
infection (nos. 16, 42, 77, 1801 and 1825) were amplified 
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with primers corresponding to sense and anti-sense 
sequences spanning the antigenic region of NS-4 (table 
1) . Nucleotide sequences obtained from the amplified DNA 
were compared and used to define consensus sequences for 
each HCV type. In-frame translation of the nucleotide 
sequences yielded an uninterrupted consensus amino acid 
sequence that was used to define a series of overlapping 
oligopeptides for epitope mapping. 

Epitope mapping and determination of antibody specificities 

Overlapping synthetic peptides were synthesised on 
polypropylene pins using kits commercially available from 
Cambridge Research Biochemicals Ltd. The principle of the 
addition reactions is described in refs (Geysen et al., 
1984; Geysen et al., 1985). Antibody reactions were 
carried out on pins disrupted by sonication (30 minutes) 
in 1% sodium dodecyl sulphate, 0.1% 2-mercaptoethanol, 
0.1M sodium dihydrogen orthophosphate. Pins were 
pre-coated in 1% ovalbumin, 1% bovine serum albumin, 0.1% 
Tween-20 in phosphate buffered saline (PBS) for one hour 
at room temperature. Serum or plasma was diluted 1.40 in 
PBS +0.1% Tween-20 (PBST) and incubated with the blocked 
pins at 4°C for 18 hours. After washing in 4 changes of 
PBST (10 minutes at room temperature, with agitation) , 
bound antibody was detected by incubation in a 1/20000 
dilution of affinity isolated anti-human IgG, peroxidase 
conjugate (Sigma) for one hour at room temperature. 
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Following washing (4 changes in PBST) , pins were incubated 
in a 0.05% solution of 

azino-^«3-ethyl-benzthiazodinsulphonate in 0.1 M sodium 
phosphate/sodium citrate buffer (pH 4.0) containing 0.03% 
hydrogen peroxide for 20 minutes. Optical densities were 
read at 410 run. 

RESULTS 

HCV RNA in plasma samples from five donors infected 
with HCV type 3 by sequence analysis of the 5 # NCR f and by 
RFLP were amplified in the NS-4 region using primers 
listed in Table 1. Because of the high degree of sequence 
variability in this region , it was necessary to use 
separate sense primers for the amplification of different 
HCV types. However, the anti-sense primers were in a 
highly conserved region and could be used for 
amplification of all three types. Sequence analysis was 
carried out as previously described. This gave a 
continuous sequence from position 4911 to 5271 (numbered 
as in Choo et al., 1991) (Figure 5a). Little sequence 
variability (highlighted) was observed between the four 
different donors in this region. 

The nucleotide sequences were used to deduce the 
sequence of the encoded peptide (Figure 5b) . The putative 
protein contains mainly hydrophillic residues but no 
potential sites for N-linked glycosylation. Amino acid 
sequence variability with HCV type 3 was confined to only 
five residues (Figure 5b) . However, this region differed 
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considerably from the amino acid sequences of other blood 
donors infected with HCV types 1 and 2 (T16 r 42, 77 , 1801, 
1825 , 351, 940 and 810; Figure 6a). Sequence comparison 
between the major HCV types from residues 1679 to 1769 
reveals three regions of considerable amino acid sequence 
variability. Most of the observed differences between 
types involve non-synonymous amino acid substitutions, 
particularly alternation of acidic and basic residues in 
the hydrophillic regions. These changes would be expected 
to profoundly alter the overall conformation of the 
protein, and its antigenicity. 

The consensus amino acid sequences in this region of 
types 1-3 (Figure 6b) were used to define three series of 
82 nonameric oligopeptides overlapping by eight of the 
nine residues with those before and after in the series 
(Figure 7a-c) . These were synthesised on a 12 x 8 arrays 
of polypropylene pins as described in Methods. Antibody 
reactivity to the immobilised antigens on the pins was 
determined by indirect ELISA, using an overnight 
incubation with a 1/40 dilution of test serum overnight at 
4°C, followed by washing, and detection with an anti-human 
IgG-peroxidase conjugate and appropriate substrate (see 
Methods) . 

Reactivity of an anti-HCV negative, PCR-negative 
donor, with no known risk factors for HCV infection with 
the three series of peptides was determined. No 
significant reactivity is shown with any of the 



WO93/10239 KT/GB92/02143 

- 38 - 

HCV-encoded oligopeptides. Reactivity of sera from three 
donors infected with HCV type 3 to each of the 
oligopeptides is shown in Figures 9a-9c. All three sera 
reacted with peptides ranging from No. 13 (sequence 
KPALVPDKE ; Figure 7) to Ho. 22 (sequence VLYQQYDEM) in the 
first antigenic region, although the precise peptides 
recognised varied slightly between individuals. All three 
sera reacted to varying extents with a second antigenic 
region, lying in the range from oligopeptides 32 to 42 (of 
sequence ECSQAAPYI to QAQVTAHWQ) . Weaker and more 
variable reactivity was observed to peptides 48 to 53. 
Finally, significant reactivity was also observed to 
single oligonucleotides 2 (2 of 3 samples), 61 (2 of 3), 
66 (3 of 3) , 73 (3 of 3) and 80 (2 of 3) . 

The sequences of the major antigenic regions of HCV 
type 3 differ considerably from those encoded by any of 
the type l or type 2 variants. The region bounded by 
peptides 13 to 22 shows average homologies of 50% with HCV 
type 2 variants and 67% with type 1. Between peptides 32 
to 42, there are homologies of 39% with type 2 and 58% 
with type 1 variants. Thus, although similar regions of 
each NS-4 sequence are antigenic, the actual epitopes 
differ considerably between HCV types. 



Discussion 

The NS-4 region of HCV type 3 shows considerable 
sequence divergence from other variants of HCV, that 
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exceeds that found in the core, NS-3 or NS-5 regions 
previously analysed (Chan et al f 1992). The function of 
the protein encoded by this region of the HCV genome is 
unknown, and the consequences of this variability on virus 
replication and pathogenesis are unknown. The function of 
the NS-4 region in flaviviruses and pestiviruses is also 
poorly defined. 

The degree of amino acid sequence variability, and 
the nature of the amino acid substitutions indicate that 
the major sites of antibody reactivity are also those of 
antigenic variiability. This undoubtedly underlies the 
restricted cross-reactivity of HCV type 1 NS-4 encoded 
antigens with sera from individuals infected with 
different HCV types. Serological diagnosis of infection 
is currently based entirely on recombinant or synthetic 
oligopeptide sequences derived ultimately from HCV type 1 
sequences (Choo et al., 1991). The serological response 
to infection is often very restricted in its initial 
stages, with antibody to only one of the recombinant 
antigens used for screening. Not only does this present 
difficulties with supplementary antibody tests, where 
reactivity to two HCV-encoded antigens is required for 
confirmation, but can lead to an increased probability of 
failing to detect early infection with HCV types 2 and 3. 

Table 7 relates HCV typing determined by PCR to 
results obtained using type-specific antigens (TSA) and 
shows good correlation for HCV1-3 types. 
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TABLES 



SEQUENCES OF NS-4 ENCODED ANTIGENS FOR (A) IMPROVED 

SEROLOGICAL DIAGNOSIS. AND (B) FOR SEROLOGICAL 
DISCRIMINATION OF INFECTION WITH DIFFERENT HCV TYPES 

A) 

Region 2 (1710.1728) 

ECSQAAPYIEQAQVIAHQF 

ECASKAALIEEGQRMAEML 
ECASRAALIEEGQRIAEML 



Region 2 (1710.1728) 

ECSQAAPYIEQAQVIAHQF 

ECASKAALIEEGQRMAEML 
ECASRAALIEEGQRIAEML 

ECS QHLPYIEGMLAEQF 
ECSQHLPYIEGALAEQF 



* Amino acid positions numbered as in Choo et aL. (1991). 

t Amino acid codes: A: alanine: R: arginine: N; asparagine: D: aspairic acid: C: cysteine: Q: 
glmamine: E: glutamic acid: G: glycine: H: histidine: L isoleucine: L: leucine: K: lysine: M: 
methionine: F: phenylalanine: P: proline: S: serine: T: threonine: W: tryptophan: Y: tyrosine: 
V: valine. 

? Alternative peptides, where there is variability within an HCV type- 



Type Region 1 (1691-1708)* 

3 KPALVPDKEVLYQQYDEMt 

2* RVWTPDKHLYEAFDEM 
RAVIAPDKEVLYEAFDEM 

B) 

Type Region I 0691-1708) 

3 KPALVPDKEVLYQQYDEM 

2 RWVTPDKEILYEAFDEM 
RAVIAPDKEVLYEAFDEM 

1 KPAEPDREVLYREFDEM 
KPAVIPDREVLYREFDEM 
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TABLE 6 



SEQUENCES OF OLIGONUCLEOTIDES SUITABLE FOR DIRECT DETECTION OF 
HCV TYPE 3 IN CLINICAL SPECIMENS BY POLYMERASE CHAIN REACTION 



Name Region 



007 NS-4 
220 NS-4 

Type 3% 
TS-3a NS-4 
TS-3b NS-4 

Type 2: 
TS-2a NS-4 
TS-2b NS-4 

Typeh 
TS-la NS-4 
TS-lb NS-4 



Position Pott 
of Sltase* 



5293 
5278 



5140 
5161 



5140 
5161 



5140 
5161 



+ 
+ 



+ 
+ 



AACTCGAGTATCCCACTGATGAAGTTCCACAT 
CACATGTGCTTCGCCCAGAA 



GCCGCCCCATATATCGAACA 
GCTCAGGTAATAGCCCACCA 



AAAGCCGCCCTCATTGAGGA 
GGGCAGCGGATGGCGGAGAT 



CACTTACCGTACATCGAGCA 
GGGATGATGCTCGCCGAGCA 



* Position of 5'base relative to HCV genomic sequence in Choo et aL. (1991). 

t Orientation of primer sequence (+: sense: -: anti-sense) 

t Abbreviations: A: adenine. C: cyndine: G: gnam'dine. T: thymidine. 

T Type-specific sense primers for amplification of HCV types 3. 2 and 1 variants. 
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TABLE 7 

COMPARISON OF SEROLOGICAL TYPING BY HCV-TSA WITH PCR 



POP 



Number TYPE-SPECIFIC ANTIBODY 

1 2 3 1+2 1+3 2+3 NTS* NR° 



1 57 63 ... 1 - 3 3 

2 12 11 - - ' 110 

3 47 1 - 45 - 2 - 4 4 



Haem* 27 U 



'Genotype ofHCV sequences amplified by PCR and typed by RFLP (McOmish etaL 1992) 

* NTS : No type-specific antibody detected 
e NR : non-reactive with NS-4 peptides 

* Samples from HCV-infected haemophiliacs, nn-typed by PCR. 
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PART IV IDENTIFICATION OF HCV TYPE -4 



PCT/GB92/02143 



Introduction 

Investigations were carried out on sequence 
variations in the 5' non-coding region (5 'NCR) of HCV 
samples from a variety of worldwide geographical locations 
(Figure 13), and also in the core region (Figures 15A and 
15B) . Phylogenetic analysis (Figures 14 and 16) revealed 
a new distinct HCV type which we refer to herein as HCV-4. 

Methods 

Samples , RNA was extracted from plasma samples that were 
repeatedly reactive on second generation screening assays 
for HCV, and which were either confirmed (significant 
reactivity with two or more antigens in the Chiron 
recombinant immunoblot assay; Chiron Corporation, 
Emeryville, California, USA) or indeterminate (reactive 
with only one antigen) from blood donors and patients with 
NANBH. Most of the samples containing sequences that 
differed substantially from known HCV types came from 
Egypt (EG 1-33). Others came from Holland (NL-26) , Hong 
Kong (HK 1-4), Iraq (IQ-48) and XX (xx-(6) . 

Sequence determination . HCV sequences were reverse 
transcribed and amplified with primers matching conserved 
regions in the 5 'NCR as previously described [l]. For 
analysis of the core region, RNA was reverse transcribed 
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using a primer of sequence CA (T/C) GT (A/G) AGGGTATCGATGAC 
(5' base: xxx, numbered as in [20]). cDKA was amplified 
using this primer and a primer in the 5 'NCR of sequence 
ACTGCCTGATAGGGTGCTTGCGAG (5' base: -54) . The second PCR 
used primers of sequences AGGTCTCGTAGACCGTGCATCATG (5' 
base: -21) and TTGCG (G/T/C) GACCT (A/T) CGCCGGGGGTC 
(5'base: xxx) . Amplified DNA in both regions was directly 
sequenced as described previously (reference la) . 

sequence analysis. Sequences were aligned using the 
CUJSTAL program in the University of Wisconsin GCG package 
(reference 6) . Phylogenetic trees were constructed by the 
DNAML program in the PHYLIP package of Felsenstein 
(version 3.4, June 1991; (reference 9), using the global 
option. RNA secondary structures in the 5 'NCR of 4 
representative HCV variants (refs) were predicted using 
the program FOLD. Three predictions were made from each 
sequence between nucleotides -341 to -1, -341 to +300, and 
-341 to +900 to allow for possible long range interactions. 
Comparison of the predicted conformations for each 
sequence over the different lengths showed that only 
relatively small scale features, such as the stem/loop 
analysed in the Results were at all conserved (data not 
shown) . 

All sequences reported in this part have been 
submitted to GenBank. 
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Results 

Divergent 5 'NCR sequences. Several sequences in the 5'NC 
region detected in samples of blood donors from Saudi 
Arabia, Holland and Hong Kong, and from NANBH patients in 
Iraq and xxx differed substantially from those found in 
Scottish blood donors and those reported elsewhere (Figure 
13) . Instead of showing the well characterised nucleotide 
substitutions that distinguish HCV types 1, 2 and 3 from 
each other, a new set of sequence differences were 
observed in the new variants that appeared to place them 
outside the existing system of virus classification. This 
can be more simply represented by reconstructing a 
phylogeny of the sequences and presenting the results as 
an evolutionary tree (Figure 14) . This analysis confirms 
that sequences 1-10 cluster separately from the variants 
previously typed as 1,2 and 3. For convenience we will 
refer to sequences within this new group as type 
HCV type 4. Mean distances within type 4 and between type 
4 and the other HCV types in the 5 'NCR were comparable to 
those previously described for type 1-3. Although 
sequences within type 4 are relatively closely grouped, 
sequences 11, 12 and 13 differ considerably from any of 
the known types. 

* Using this phylogenetic tree, it can be seen that the 

majority of previously published 5 'NCR sequences can be 
readily identified as types 1, 2 or 3. Furthermore, 
almost all of the sequences from Zaire (shown as hollow 
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squares) cluster closely within type 4, suggesting a wider 
distribution in Africa. However, a further complication 
is that three identical sequences obtained from South 
African patients appeared distinct from both the type 1 
and the type 4 group, and may represent yet another HCV 
type. 

RNA from three representative type 4 variants (Eg 29, 
33, 21; corresponding to 5 'NCR sequences nos. 1-3) was 
amplified using primers in the core region of HCV 
polyprotein. All three sequences differed considerably at 
both the nucleotide and amino acid level from HCV types 1 
to 3 (Figure 15A/B) . Phylogenetic analysis of these 
sequences and those previously analysed indicated that 
they formed a separate, relatively homogeneous group 
distinct from the other types (Figure 16) . Reconstructed 
nucleotide distances between type 4 and types 1-3 were 
comparable to those that exist between the three known HCV 
types of HCV. Although most of the nucleotide sequence 
differences were silent, there were between 4 and 9 amino 
acid differences between the new variants and other types. 
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PART V HCV TYPING 
Introduction 

In view of the sequence variations between HCV types 
1,2,3 and 4 differences in restriction enzyme cleavage sites 
exist, leading to different endonuclease cleavage patterns. 
This technique was used to identify HCV genotypes in blood 
samples from a variety of sources worldwide. 

(A) Typing of HCV1-3 
METHODS 

Serum Samples ; Samples from blood donors in six countries, 
Scotland, Finland, Netherlands, Hong Kong and Australia and 
Japan, were available from routine 2nd Generation anti-HCV 
ELISA screening (Ortho or Abbott) . Donor samples that were 
repeatedly reactive in the above tests were further 
investigated using a supplementary test (Ortho RIBA: 
Finland, Netherlands, Australia, Egypt, Abbott Matrix: Hong 
Kong) or samples were titred for anti-HCV by ELISA (Japan) . 
Samples that were positive (significant reactivity with two 
or more HCV antigens (1+ to 4+) or indeterminate (reactivity 
with one antigen only) in the RIBA test or had a titre of > 
X 4096 by ELISA (Japan only) were tested for viral RNA by 
Polymerase Chain Reaction (PCR) . 

RNA PGR ; PCR for the detection of HCV RNA was carried out as 
previously described by Chan et al (reference la) using 
primers in the 5 'non-coding region (5 'NCR) in a nested PCR, 
with primers 209/939 and 211/940(5) in first and second 
reactions respectively. 
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HCV TYPING 

The existence of relatively conserved patterns of 
substitutions in the 5 'NCR that are characteristic of 
different HCV types provide useful signature sequences for 
identification of HCV genotypes. Having compared large 
numbers of different HCV type 1, 2 and 3 sequences, we 
developed a method that differentiated HCV types 1-3 by 
restriction endonuclease cleavage of amplified DNA. 
However, the 19 type 4 sequences would appear as type 1 
(electrophoretic types Aa and Ab) , and for concurrent 
studies it has been necessary to modify the conditions to 
identify the new HCV type. All type 4 sequences showed a 
T — >C change at position -167 that creates a novel Hinfl 
site that is absent in all type 1 (and type 2) sequences. 
In combination with ScrFI, and Haelll/Rsal, it has now 
proved possible to identify the new type reliably in 
numerous countries in the Middle East and elsewhere. 



RESULTS 

The results are summarised in Table 8 for HCV types 
1, 2 and 3. The Egyption samples gave abberant 
restriction patterns on the single ScrFI digest and were 
identified as type 4. 
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TABLE 8 



PREVALENCE OF HCV TYPES IN DIFFERENT COUNTRIES 



COUNTRY 



HCV TYPES (%) 
HCV-1 HCV- 2 HCV- 3 



Scotland 



86(51%) 21(13%) 60(36%) 



Finland 



3(25%) 5(42%) 4(33%) 



Netherlands 



18(60%) 7(23%) 



5(17%) 



Hong Kong 



22(63%) 0(0%) 



0(0%) 



Australia 



13(57%) 3(13%) 7(30%) 



Japan 



31(77%) 9(23%) 



0(0%) 



Egypt 



0(0%) 0(0%) 



0(0%) 
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(B) MODIFICATION OF PCR-BASED TYPING ASSAY TO DETECT 
tttprpttow WITH HCV TYPE 4 IN rr .TKTCAI. SPECIMENS 



Methods 

gy^rtion of SNA . ENA was extracted from 100 ul aliquots 
of plasma of non-A, non-B patients by addition of 1 ml 
RNAzol solution (2M guanidinium thiocyanate, 12.5 mM 
sodium citrate [pH7.0], 0.25% w/v N-lauroylsarcosine, 
0.05M 2-mercaptoethanol, lOOmM sodium acetate [pH 4.0], 
50% w/v water saturated phenol) as previously described 
(Chomczynski et al. 1987) , and mixed until precipitate 
dissolved. After addition of 100 ul chloroform, each 
sample was spun for 5 minutes at 14000 x g and the aqueous 
phase re-extracted with 0.5 ml chloroform. UNA was 
precipitated by addition of an equal volume of isopropanol 
and incubation at -20°C for at least 1 hour. An ENA 
pellet was produced by centrifugation at 14000 x g for 15 
minutes at 4°C, washed in 1ml 70% cold ethanol solution, 
dried and resuspended in 20 ul diethylpyrocarbonate 
treated distilled water. Of the 100 directly extracted 
samples, a total of 19 were PCR-negative (see below) . Two 
ml volumes of the negative samples were ultracentrifuged 
at 200 000 x g for 2 hours and the pellet re-extracted as 
described above. Extraction from the larger volume of 
plasma yielded an additional 3 positive samples (numbers 
66, 80, 85). 
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PGR and typing . RNA was reverse transcribed with primer 
940 and cDNA amplified in a two stage nested PCR reaction 
with primers 940/939, followed by 209/211 as previously 
described (Chan et al. 1992). PCR product was 
radiolabelled with [ 35 S]-dATP analysed by restriction 
endonuclease cleavage (HcOmish et al. Transfusion, 
32: no. 11 1992). Samples were cleaved with ScrFI and a 
combination of Haelll/Rsal in two separate reactions to 
identify HCV types 1/4 f 2, 3. Figure 17 shows 
endonuclease cleavage patterns. HCV types 1 and 4 were 
differentiated by a third reaction with Hinfl (see 
Results) • Two samples yielded restriction patterns that 
were different from those of the four known types of HCV 
and were analysed further by direct sequence analysis of 
the amplified DNA (Chan et al. 1992). These two samples 
contained 5 'NCR sequences distinct from those of known HCV 
types and currently remain unclassified. 

RESULTS 

Modification of RFLP method to identify HCV type 4. 
Previous sequence analysis in the 5 'NCR of HCV amplified 
from plasma of Egyptian blood donors revealed a relatively 
homogeneous group of novel sequence variants in both the 
5 'NCR and core region which were as distinct from HCV 
types 1, 2 and 3 as these latter types were from each 
other (see previous submission) . This new group was 
designated as HCV type 4. 
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Comparison of cleavage patterns of type 4 sequences 
with those of type RFLP analysis of the previously 
identified type 4 sequences produced a distribution of 
electropherotypes with ScrFI and Haelll/Rsal similar to 
that HCV type 1 (Table 9) . Type 1 sequences yielded 9 
patterns of aA/B, 35 of bA/B and 1 bC. With these enzymes 
alone, type 4 sequences were thus indistinguishable from 
type l (14 aA/B, 4 bA/B) . However, type 1 and type 4 
sequences consistently differ in the number of Hinfl 
sites. All 18 type 4 sequences contain one or two 
potential cleavage sites (producing patterns band c; table 
5) while none are found in any of the 45 type 1 sequences 
analysed (pattern a) . One of the type 4 sequences was 
further differentiated from type 1 and other HCV types by 
the loss of a restriction site for Rsal, leading to a new 
pattern of bands designated h (44, 172, 9, 26; first 
column. Table 9) . Finally, a single sequence, EG-28 lost 
two sites to produce bands of 216, 9, and 26 bps (pattern 
i; Table 9) . This sequence was distinct from that of any 
of the known HCV types (including type 4) and is shown in 
the table in the column labelled U (unclassified) 

Typing of study subjects. RNA was extracted from 100 
samples of patients with NANB hepatitis and amplified with 
primers in the 5 'NCR. Of these, 84 were PCR positive, and 
enabled HCV typing to be carried out by RFLP. This was 



WO 93/10239 



PCT/GB92/02143 



- 53 - 

initially carried out with Haelll/Rsal and ScrFl, and 
allowed the identification of 10 type 2 and 10 type 3 
variants (Table 10) . Samples showing electrophoretic 
patterns aA/B or bA/B were further analysed by cleavage 
with hinfl, yielding 38 samples with pattern a, thus 
identified as type 1, 22 with pattern b and 2 with pattern 
c, both identified as type 4. Finally, two samples showed 
the unusual cleavage patterns h and i with Haelll/Rsal and 
pattern b with Hinfl, and were therefore directly 
sequenced. These two sequences were similar to each other 
but were unlike any of the known HCV types, and also 
distinct from EG-28, the other sequence showing pattern i 
with Haelll/Rsal (Table 10) • As they cannot be currently 
classified, they will be referred to as type U. 
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TABLE 9 

PREDICTED CLEAVAGE PATTERNS OF PUBLISHED 5'NCR SEQUENCES 
OF HCV TYPES 1. 2. 3 AND 4 WITH RsaUHaeUL ScrFl AND Hirfi 



Predicted 


cleavage p 


altera 3 




HCV type 






fffifTTT 














/JtazT 

/AJwl 


ScrFl 


Hiafl 


1 


2 3 


4 


V 


d. 


A/B 


a c 


9 




- 


- 


b 


A/B 


a 


35 


- 


- 


- 


b 


C 


a 


1 








a 


A/B 


b d 




_ _ 


13 


- 


a 


A/B 




- 


- 


1 


- 


b 


A3 


b 






4 




c 


D 


a 


- 


5 


- 


- 


• 

d 


L/ 


a 




1 






d 


E 


a 




2 






e 


D 


a 




1 






e 


E 


a 




1 






f 


G 


b 




14 






f 


G 


c 




1 






£ 


G 


b 




8 






1/ 


A/B 


b 






i 






A/B 


b 








1 



• Cleavage patterns designated fat HaeSURsdl and Scrfl as described previously (McOmish 
eraL 1992). 

• Cleavage panr™ of an HCV variant of undesigned type 
c Pastern a : uncieaved by Hinfi 

d P attern b : DNA cleaved to generate two fragments of sizes 107 and 142 bps (in order 5*-> 
3') 

• pattern c : DNA cleaved to generate three fragments of 56. 51 and 142 bps 

f New cleavage pattern for HaeUHRsaL designated h (bands of 44 bps. 172 bps. 9 bps. 26 bps) 
s N ew cleavage pattern for HaeHURsdL designated i (216 bps. 9 bps. 26 bps) 
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TABLE 10 

IDENTIFICATION OF HCV TYPES 1-4 ,N STUDY SUBJECTS BY RFLP 
ANALYSIS OF 5' NCR SEQUENCES WITH RsaUHaeUl. ScrFl AND Hinfl. 



Observed i 


deavage p 


altera 


Inferred HCV type 




Haem 












U* 


IRial 


ScrFl 


ttUlJl 


1 2 


3 


4 


a 


A/B 


a 


2 




_ 


- 


b 


A/B 


a 


36 


- 


- 




a 


A/B 


b 


- 


- 


16 


- 


a 


A/B 


c 






2 




b 




b 






6 


- 


c 


D 


n.d. 


7 








d 


E 


nA 


3 








f 


G 


n.d 




7 






g 


G 


n.d. 




3 






h 


A/B 


b 








1 


• 

i 


a/b 


b 








1 






TOTALS 


38 10 


10 


24 


2 



* Two samples yielded unusual restriction patterns with HaeWRsal (h, i). Sequence analysis 
of the 5'NCR placed them outside existing HCV classification (samples IQ-48, EG-96). 
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PART VI Expression and Assay etc. Techniques 

The present invention also provides expression 
vectors containing the DNA sequences as herein defined, 
which vectors being capable, in an appropriate host, of 
expressing the DNA sequence to produce the peptides as 

defined herein. 

The expression vector normally contains control 
elements of DNA that effect expression of the DNA sequence 
in an appropriate host. These elements may vary according 
to the host but usually include a promoter, ribosome 
binding site, translational start and stop sites, and a 
transcriptional termination site. Examples of such 
vectors include plasmids and viruses. Expression vectors 
of the present invention encompass both extrachromosomal 
vectors and vectors that are integrated into the host 
cell's chromosome. For use in E.coli, the expression 
vector may contain the DNA sequence of the present 
invention optionally as a fusion linked to either the 5'- 
or 3 '-end of the DNA sequence encoding, for example, 
B-galactosidase or to the 3 '-end of the DNA sequence 
encoding, for example, the trp E gene. For use in the 
insect baculovirus (AcNFV) system, the DNA sequence is 
optionally fused to the polyhedrin coding sequence. 

The present invention also provides a host cell 
transformed with expression vectors as herein defined. 

Examples of host cells of use with the present 
invention include prokaryotic and eukaryotic cells, such 
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as bacterial, yeast, mammalian and insect cells. 
Particular examples of such cells are E.coli, 
S.cerevisiae, P»pastoris, Chinese hamster ovary and mouse 
cells, and Spodoptera frugiperda and Tricoplusia ni. The 
choice of host cell may depend on a number of factors but, 
if post-translational modification of the HCV viral 
peptide is important, then an eukaryotic host would be 
preferred. 

The present invention also provides a process for 
preparing a peptide as defined herein which comprises 
isolating the DNA sequence, as herein defined, from the 
HCV genome, or synthesising DNA sequence encoding the 
peptides as defined herein, or generating a DNA sequence 
encoding the peptide, inserting the DNA sequence into an 
expression vector such that it is capable, in an 
appropriate host, of being expressed, transforming host 
cells with the expression vector, culturing the 
transformed host cells, and isolating the peptide. 

The DNA sequence encoding the peptide may be 
synthesised using standard procedures (Gait, 
Oligonucleotide Synthesis: A Practical Approach , 1984, 
Oxford, IRL Press) . 

The desired DNA sequence obtained as described above 
may be inserted into an expression vector using known and 
standard techniques. The expression vector is normally 
cut using restriction enzymes and the DNA sequence 
inserted using blunt-end or staggered-end ligation. The 
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cut is usually made at a restriction site in a convenient 
position in the expression vector such that, once 
inserted, the DNA sequences are under the control of the 
functional elements of DNA that effect its expression. 

Transformation of an host cell may be carried out 
using standard techniques. Some phenotypic marker is 
usually employed to distinguish between the transformants 
that have successfully taken up the expression vector and 
those that have not. Culturing of the transformed host 
cell and isolation of the peptide as required may also be 
carried out using standard techniques. 

The peptides of the present invention may be prepared 
by synthetic methods or by recombinant DNA technology. 
The peptides are preferably synthesized using automatic 
synthesizers. 

Antibody specific to a peptide of the present 
invention can be raised using the peptide. The antibody 
may be polyclonal or monoclonal. The antibody may be used 
in quality control testing of batches of the peptides; 
purification of a peptide or viral lysate; epitope 
mapping; when labelled, as a conjugate in a competitive 
type assay, for antibody detection; and in antigen 
detection assays. 

Polyclonal antibody against a peptide of the present 
invention may be obtained by injecting a peptide, 
optionally coupled to a carrier to promote an immune 
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response, into a mammalian host, such as a mouse, rat, 
sheep or rabbit, and recovering the antibody thus 
produced. The peptide is generally administered in the 
form of an injectable formulation in which the peptide is 
admixed with a physiologically acceptable diluent. 
Adjuvants, such as Freund's complete adjuvant (FCA) or 
Freund's incomplete adjuvant (FIA) , may be included in the 
formulation. The formulation is normally injected into 
the host over a suitable period of time, plasma samples 
being taken at appropriate intervals for assay for 
anti-HCV viral antibody. When an appropriate level of 
activity is obtained, the host is bled. Antibody is then 
extracted and purified from the blood plasma using 
standard procedures, for example, by protein A or 
ion-exchange chromatography. 

Monoclonal antibody against a peptide of the present 
invention may be obtained by fusing cells of an 
immortalising cell line with cells which produce antibody 
against the viral or topographically related peptide, and 
culturing the fused immortalised cell line. Typically, a 
non-human mammalian host, such as a mouse or rat, is 
inoculated with the peptide. After sufficient time has 
elapsed for the host to mount an antibody response, 
antibody producing cells, such as the splenocytes, are 
removed. Cells of an immortalising cell line, such as a 
mouse or rat myeloma cell line, are fused with the 
antibody producing cells and the resulting fusions 
screened to identify a cell line, such as a hybridoma, 
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that secretes the desired monoclonal antibody. The fused 
cell line may be cultured and the monoclonal antibody 
purified from the culture media in a similar manner to the 
purification of polyclonal antibody. 

Diagnostic assays based upon the present invention 
may be used to determine the presence or absence of HCV 
infection. They may also be used to monitor treatment of 
such infection, for example in interferon therapy. 

In an assay for the diagnosis of viral infection, 
there are basically three distinct approaches that can be 
adopted involving the detection of viral nucleic acid, 
viral antigen or viral antibody. Viral nucleic acid is 
generally regarded as the best indicator of the presence 
of the virus itself and would identify materials likely to 
be infectious. However, the detection of nucleic acid is 
not usually as straightforward as the detection of 
antigens or antibodies since the level of target can be 
very low. Viral antigen is used as a marker for the 
presence of virus and as an indicator of infectivity. 
Depending upon the virus, the amount of antigen present in 
a sample can be very low and difficult to detect. 
Antibody detection is relatively straightforward because, 
in effect, the host immune system is amplifying the 
response to an infection by producing large amounts of 
circulating antibody. The nature of the antibody response 
can often be clinically useful, for example IgM rather 
than IgG class antibodies are indicative of a recent 
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infection, or the response to a particular viral antigen 
may be associated with clearance of the virus. Thus the 
exact approach adopted for the diagnosis of a viral 
infection depends upon the particular circumstances and 
the information sought. In the case of HCV, a diagnostic 
assay may embody any one of these three approaches. 

In an assay for the diagnosis of HCV involving 
detection of viral nucleic acid, the method may comprise 
hybridising viral RNA present in a test sample, or cDNA 
synthesised from such viral RNA, with a DNA sequence 
corresponding to the nucleotide sequences of the present 
invention or encoding a peptide of the invention, and 
screening the resulting nucleic acid hybrids to identify 
any HCV viral nucleic acid. The application of this 
method is usually restricted to a test sample of an 
appropriate tissue, such as a liver biopsy, in which the 
viral RNA is likely to be present at a high level. The 
DNA sequence corresponding to a nucleotide sequence of the 
present invention or encoding a peptide of the invention 
may take the form of an oligonucleotide or a cDNA sequence 
optionally contained within a plasmid. Screening of the 
nucleic acid hybrids is preferably carried out by using a 
labelled DNA sequence. Preferably the peptide of the 
present invention is part of an oligonucleotide wherein 
the label is situated at a sufficient distance from the 
peptide so that binding of the peptide to the viral 
nucleic acid is not interfered with by virtue of the label 
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being too close to the binding site. One or more 
additional rounds of screening of one kind or another may 
be carried out to characterise further the hybrids and 
thus identify any HCV viral nucleic acid. The steps of 
hybridisation and screening are carried out in accordance 
with procedures known in the art. 

The present invention also provides a test kit for 
the detection of HCV viral nucleic acid, which comprises 

i) a labelled oligonucleotide comprising a DMA sequence 
of the present invention or encoding a peptide of the 
present invention; and 

ii) washing solutions, reaction buffers and a substrate, 
if the label is an enzyme. 

Advantageously, the test kit also contains a positive 
control sample to facilitate in the identification of 
viral nucleic acid. 

In an assay for the diagnosis of HCV involving 
detection of viral antigen or antibody, the method may 
comprise contacting a test sample with a peptide of the 
present invention or a polyclonal or monoclonal antibody 
against the peptide and determining whether there is any 
antigen-antibody binding contained within the test sample. 
For this purpose, a test kit may be provided comprising a 
peptide, as defined herein, or a polyclonal or monoclonal 
antibody thereto and means for determining whether there 
is any binding with antibody or antigen respectively 
contained in the test sample. The test sample may be 
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taken from any of the appropriate tissues and 
physiological fluids mentioned above for the detection of 
viral nucleic acid. If a physiological fluid is obtained, 
it may optionally be concentrated for any viral antigen or 
antibody present. 

A variety of assay formats may be employed. The 
peptide can be used to capture selectively antibody 
against HCV from solution, to label selectively the 
antibody already captured, or both to capture and label 
the antibody. In addition, the peptide may be used in a 
variety of homogeneous assay formats in which the antibody 
reactive with the peptide is detected in solution with no 
separation of phases. 

The types of assay in which the peptide is used to 
capture antibody from solution involve immobilization of 
the peptide on to a solid surface. This surface should be 
capable of being washed in some way. Examples of suitable 
surfaces include polymers or various types (moulded into 
microtitre wells; beads; dipsticks of various types; 
aspiration tips; electrodes; and optical devices), 
particles (for example latex; stabilized red blood cells; 
bacterial or fungal cells; spores; gold or other metallic 
or metal-containing sols; and proteinaceous colloids) with 
the usual size of the particle being from 0.02 to 5 
microns, membranes (for example of nitrocellulose; paper; 
cellulose acetate; and high porosity/high surface area 
membranes of an organic or inorganic material) . 
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The attachment of the peptide to the surface can be 
by passive adsoption from a solution of optimum 
composition which may include surfactants, solvents, salts 
and/or chaotropes; or by active chemical bonding. Active 
bonding may be through a variety of reactive or 
activatible functional groups which may be exposed on the 
surface (for example condensing agents? active acid 
esters, halides and anhydrides; amino, hydroxyl, or 
carboxyl groups; sulphydryl groups; carbonyl groups; 
diazo groups; or unsaturated groups). Optionally, the 
active bonding may be through a protein (itself attached 
to the surface passively or through active bonding) , such 
as albumin or casein, to which the viral peptide may be 
chemically bonded by any of a variety of methods. The use 
of a protein in this way may confer advantages because of 
isoelectric point, charge, hydrophilicity or other 
physico-chemical property. The viral peptide may also be 
attached to the surface (usually but not necessarily a 
membrane) following electrophorectic separation of a 
reaction mixture, such as immunoprecipitation. 

After contacting (reacting) the surface bearing the 
peptide with a test sample, allowing time for reaction, 
and, where necessary, removing the excess of the sample by 
any of a variety of means, (such as washing, 
centrifugation, filtration, magnetism or capilliary 
action) the captured antibody is detected by any means 
which will give a detectable signal. For example, this 
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may be achieved by use of labelled molecule or particle as 
described above which will react with the captured 
antibody (for example protein A or protein G and the like; 
anti-species or anti-immunoglobul in-sub-type; rheumatoid 
factor; or antibody to the peptide, used in a competitive 
or blocking fashion) , or any molecule containing an 
epitope contained in the peptide. 

The detectable signal may be optical or radioactive 
or physico-chemical and may be provided directly by 
labelling the molecule or particle with, for example, a 
dye, radiolabel, electroactive species, magnetically 
resonant species or fluorophore, or indirectly by 
labelling the molecule or particle with an enzyme itself 
capable of giving rise to a measurable change of any 
sort. Alternatively the detectable signal may be obtained 
using, for example, agglutination, or through a 
diffraction or birefringent effect if the surface is in 
the form of particles. 

Assays in which a peptide itself is used to label an 
already captured antibody require some form of labelling 
of the peptide which will allow it to be detected. The 
labelling may be direct by chemically or passively 
attaching for example a radio label, magnetic resonant 
species, particle or enzyme label to the peptide; or 
indirect by attaching any form of label to a molecule 
which will itself react with the peptide. The chemistry 
of bonding a label to the peptide can be directly through 
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a moiety already present in the peptide, such as an amino 
group, or through an intermediate moiety, such as a 
maleimide group. Capture of the antibody may be on any of 
the surfaces already mentioned by any reagent including 
passive or activated adsorption which will result in 
specific antibody or immune complexes being bound. In 
particular, capture of the antibody could be by 
anti-species or anti-immunoglobulin-sub-type, by 
rheumatoid factor, proteins A, G and the like, or by any 
molecule containing an epitope contained in the peptide. 

The labelled peptide may be used in a competitive 
binding fashion in which its binding to any specific 
molecule on any of the surfaces exemplified above is 
blocked by antigen in the sample. Alternatively, it may 
be used in a non-competitive fashion in which antigen in 
the sample is bound specifically or non-specif ically to 
any of the surfaces above and is also bound to a specific 
bi- or polyvalent molecule (e.g. an antibody) with the 
remaining valencies being used to capture the labelled 
peptide. 

Often in homogeneous assays the peptide and an 
antibody are separately labelled so that, when the 
antibody reacts with the recombinant peptide in free 
solution, the two labels interact to allow, for example, 
non-radiative transfer of energy captured by one label to 
the other label with appropriate detection of the excited 
second label or quenched first label (e.g. by fluorimetry, 
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magnetic resonance or enzyme measurement) . Addition of 
either viral peptide or antibody in a sample results in 
restriction of the interaction of the labelled pair and 
thus in a different level of signal in the detector. 

A suitable assay format for detecting HCV antibody is 
the direct sandwich enzyme immunoassay (EIA) format. A 
peptide is coated onto microtitre veils. A test sample 
and a peptide to which an enzyme is coupled are added 
simultaneously. Any HCV antibody present in the test 
sample binds both to the peptide coating the well and to 
the enzyme-coupled peptide. Typically, the same peptide 
are used on both sides of the sandwich. After washing, 
bound enzyme is detected using a specific substrate 
involving a colour change. A test kit for use in such an 
EIA comprises: 

(1) a peptide, as herein defined labelled with an enzyme; 

(2) a substrate for the enzyme; 

(3) means providing a surface on which a peptide is 
immobilised; and 

(4) optionally, washing solutions and/or buffers. 

It is also possible to use IgG/IgM antibody capture 
ELISA wherein an antihuman antibody is coated onto 
microlitre wells, a test sample is added to the well. Any 
IgG or IgM antibody present in the test sample will then 
bind to the anti-human antibody. A peptide of the present 
invention, which has been labelled, is added to the well 
and the peptide will bind to any IgG or IgM antibody which 
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has resulted due to infection by HCV. The IgG or IgM 
antibody can be visualized by virtue of the label on the 
peptide. 

It can thus be seen that the peptides of the present 
invention may be used for the detection of HCV infection 
in many formats, namely as free peptides, in assays 
including classic ELISA, competition ELISA, membrane bound 
EIA and immunoprecipitation. Peptide conjugates may be 
used in amplified assays and IgG/ IgM antibody capture 
ELISA. 

An assay of the present invention may be used, for 
example, for screening donated blood or for clinical 
purposes, for example, in the detection and monitoring of 
HCV infections. For screening purposes, the preferred 
assay formats are those that can be automated, in 
particular, the microtitre plate format and the bead 
format. For clinical purposes, in addition to such 
formats, those suitable for smaller-scale or for single 
use, for example, latex assays, may also be used. For 
confirmatory assays in screening procedures, antigens may 
be presented on a strip suitable for use in Western or 
other immunoblotting tests. 

As indicated above, assays used currently to detect 
the presence of anti-HCV antibodies in test samples, 
particularly in screening donated blood, utilise antigenic 
peptides obtained from HIV type 1 only and, as 
demonstrated herein, such antigens do not reliably detect 
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other HCV genotypes. Accordingly, it is clearly desirable 
to supplement testing for HIV-1 with testing for all other 
genotypes, for example, types 2, 3 and 4, and also any 
further genotypes that may be discovered. 

To test for a spectrum of genotypes, there may be 
provided a series of assay means each comprising one or 
more antigenic peptides from one genotype of HCV, for 
example, a series of wells in a microtitre plate, or an 
equivalent series using the bead format. Such an assay 
format may be used to determine the genotype of HCV 
present in a sample. Alternatively, or in addition, an 
assay means may comprise antigenic peptides from more than 
one genotype, for example, a microwell or bead may be 
coated with peptides from more than one genotype. 

It has been found advantageous to use more than one 
HCV antigen for testing, in particular, a combination 
comprising at least one antigenic peptide derived from the 
structural region of the genome and at least one antigenic 
peptide derived from the non-structural region, especially 
a combination of a core antigen and at least one antigen 
selected from the NS3, NS4 and NS5 regions. The wells or 
beads may be coated with the antigens individually. It 
has been found advantageous, however, to fuse two or more 
antigenic peptides as a single polypeptide, preferably as 
a recombinant fusion polypeptide. Advantages of such an 
approach are that the individual antigens can be combined 
in a fixed, predetermined ratio (usually equimolar) and 
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that only a single polypeptide needs to be produced, 
purified and characterised. One or more such fusion 
polypeptides may be used in an assay, if desired in 
addition to one or more unfused peptides. It will be 
appreciated that there are many possible combinations of 
antigens in a fusion polypeptide, for example, a fusion 
polypeptide may comprise a desired range of antigens from 
one serotype only, or may comprise antigens from more than 
one serotype. The antigenic peptides from serotypes 2, 3 
and 4 are preferably those described herein. 

To obtain a polypeptide comprising multiple peptide 
antigens, it is preferred to fuse the individual coding 
sequences into a single open reading frame. The fusion 
should, of course, be carried out in such a manner that 
the antigenic activity of each component peptide is not 
significantly compromised by its position relative to 
another peptide. Particular regard should of course be 
had for the nature of the sequences at the actual junction 
between the peptides. The resulting coding sequence can 
be expressed, for example, as described above in relation 
to recombinant peptides in general. The methods by which 
such a fusion polypeptide can be obtained are known in the 
art, and the production of a recombinant fusion 
polypeptide comprising multiple antigens of a strain of 
HCV type 1 is described in GB-A-2 239 245 
immunoprecipitation. Peptide conjugates may be used in 
amplified assays and IgG/IgM antibody capture ELISA. 
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The peptide of the present invention may be 
incorporated into a vaccine formulation for inducing 
immunity to HCV in man. For this purpose the peptide may 
be presented in association with a pharmaceutical^ 
acceptable carrier. 

For use in a vaccine formulation, the peptide may 
optionally be presented as part of an hepatitis B core 
fusion particle, as described in Clarke et al ( Nature, 
1987, 330 , 381-384), or a polylysine based polymer, as 
described in Tarn ( PNAS , 1988, 85, 5409-5413). 
Alternatively, the peptide may optionally be attached to a* 
particulate structure, such as lipsomes or ISCOMS. 

Pharmaceutically acceptable carriers include liquid 
media suitable for use as vehicles to introduce the 
peptide into a patient. An example of such liquid media 
is saline solution. The peptide may be dissolved or 
suspended as a solid in the carrier. 

The vaccine formulation may also contain an adjuvant 
for stimulating the immune response and thereby enhancing 
the effect of the vaccine. Examples of adjuvants include 
aluminium hydroxide and aluminium phosphate. 

The vaccine formulation may contain a final 
concentration of peptide in the range from 0.01 to 
5 mg/ml, preferably from 0.03 to 2 mg/ml. The vaccine 
formulation may be incorporaterd into a sterile container, 
which is then sealed and stored at a low temperature, for 
example 4°C, or may be freeze-dried. 
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In order to induce immunity in man to HCV, one or 
more doses of the vaccine formulation may be 
administered. Each dose may be 0.1 to 2 ml, preferably 
0.2 to 1 ml. A method for inducing immunity to HCV in 
man, comprises the administration of an effective amount 
of a vaccine formulation, as hereinbefore defined. 

The present invention also provides the use of a 
peptide as herein defined in the preparation of a vaccine 
for use in the induction of immunity to HCV in man. 

Vaccines of the present invention may be administered 
by any convenient method for the administration of 
vaccines including oral and parenteral (e.g. intravenous, 
subcutaneous or intramuscular) injection. The treatment 
may consist of a single dose of vaccine or a plurality of 
doses over a period of time. 
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SEQUENCE LISTING (FIGURE 1 and la) 



SEQ ID NO 



SEQUENCE TYPE : Nucleotide cDNA sequence 



SEQUENCE LENGTH : 194 base pairs 



STRANDEDNESS : single 



TOPOLOGY 



linear 



MOLECULAR TYPE : genomic DNA from HCV types 1-3 



ORIGINAL SOURCE 



ORGANISM 



human blood samples 



USE 



DEPOSIT 



FEATURES 



Bases - 255 to -62 of the 5' non-coding region showing 
variations in cDNA sequence between HCV types 1-3. 
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SEQUENCE LISTING f FIGURE 3) 



SEQ ID NO 



SEQUENCE TYPE : deduced peptide sequence 



SEQUENCE LENGTH : 85 amino acids 



STRANDEDNESS 



single 



TOPOLOGY 



linear 



MOLECULAR TYPE : HCV peptide 



ORIGINAL SOURCE 



ORGANISM 



human blood samples 



USE 



DEPOSIT 



FEATURES 



Amino acids 2648 to 2732 of NS-5 region showing variations 
in peptide sequence between HCV types 1-3. 
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SEQUENCE LISTING (FIGURE 5) 



SEQ ID NO 



SEQUENCE TYPE : deduced peptide sequence 



SEQUENCE LENGTH : 57 amino acids 



STRANDEDNESS 



single 



TOPOLOGY 



linear 



MOLECULAR TYPE : HCV peptide 



ORIGINAL SOURCE 



ORGANISM 



: human blood samples 



USE 



DEPOSIT 



FEATURES 



Amino acids 1577 to 163 3 of NS-3 region showing variations 
in peptide sequence between HCV types 1-3. 
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SEQUENCE T.TfiTTNG (FIGURE 7) 



SEQ ID NO 



SEQUENCE TYPE : deduced peptide sequence 



SEQUENCE LENGTH : 124 amino acids 



STRANDEDNESS 



single 



TOPOLOGY 



linear 



MOLECULAR TYPE : HCV peptide 



ORIGINAL SOURCE 



ORGANISM 



: human blood samples 



USE 



DEPOSIT 



FEATURES 



Amino acids 5 to 128 of core region showing variations in 
peptide sequence between HCV types 1-3, 
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SEQUENCE LISTING (FIGURE 9a) 



SEQ ID NO 



SEQUENCE TYPE : Nucleotide sequence 



SEQUENCE LENGTH : 367 base pairs 



STRANDEDNESS : single 



TOPOLOGY 



linear 



MOLECULAR TYPE : HCV genomic DNA 



ORIGINAL SOURCE 



ORGANISM 



human blood samples 



USE 



DEPOSIT 



FEATURES 



Bases 4911 to 5277 of NS-4 region of HCV-3 showing 
individual variations and consensus sequence. 
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SEQUENCE LISTING (FIGURE 9b > 



SEQ ID NO 



SEQUENCE TYPE : deduced peptide sequence 



SEQUENCE LENGTH : 128 amino acids 



STRANDEDNESS 



single 



TOPOLOGY 



linear 



MOLECULAR TYPE : HCV genomic DNA 



ORIGINAL SOURCE 



ORGANISM 



human blood samples 



USE 



DEPOSIT 



FEATURES 



Amino acids 1638 to 1765 of NS-4 region of HCV-3 showing 
individual variations and consensus sequence. 
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SEQUENCE LISTING (FIGURE 10a and 10b) 



SEQ ID NO 



SEQUENCE TYPE : peptide sequence 



SEQUENCE LENGTH : 90 amino acids 



STRANDEDNESS 



single 



TOPOLOGY 



linear 



MOLECULAR TYPE : HCV peptide 



ORIGINAL SOURCE 



ORGANISM 



human blood samples 



USE 



DEPOSIT 



FEATURES 



Amino acids 1679 to 17 68 of NS-4 region of HCV1-3 showing 
individual variations and consensus sequence. 
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SEQUENCE LISTING (FIGURE 11 a and 11c) 



SEQ ID NO : 8 



SEQUENCE TYPE : Nonameric peptide sequences 



SEQUENCE LENGTH : 9 amino acids each 



STRANDEDNESS : single 



TOPOLOGY 



: linear 



MOLECULAR TYPE : nonameric HCV peptides 



ORIGINAL SOURCE 



ORGANISM 



synthetic 



USE 



: epitope mapping 



DEPOSIT 



FEATURES 



Nonapeptides corresponding to NS4 region of HCV1-3 used 
for epitope mapping- 
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SEQUENCE LISTING (FIGURE 13) 



SEQ ID NO 



SEQUENCE TYPE : Nucleotide cDNA sequences 



SEQUENCE LENGTH : 30, 70 and 32 base pairs 



STRANDEDNESS : single 



TOPOLOGY 



linear 



MOLECULAR TYPE : genomic DNA from HCV types 1 to 4 



ORIGINAL SOURCE 



ORGANISM 



: human blood samples 



USE 



DEPOSIT 



FEATURES 



Bases -245 to -216; -185 to -116; and -101 to -70 in the 
5 'NCR region of HCV types 1-4. 



WO93/10239 PCT/GB92/02143 

- 88 - 



SEQUENCE LISTING ( FIGURE 1 5A and 15B) 



SEQ ID NO : 10 

SEQUENCE TYPE : Nucleotide with derived protein sequence 

SEQUENCE LENGTH : 240 base pairs 

STRANDEDNESS : single 

TOPOLOGY : linear 

MOLECULAR TYPE : genomic DNA 



ORIGINAL SOURCE 



ORGANISM r human blood samples 



USE 



DEPOSIT : 
FEATURES 

Base pairs 23 - 262 and amino acids 5 to 89 of the core 
region of HCV types 1-4. 
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CLAIMS 

1. A polynucleotide sequence unique to hepatitis C virus 
type 3 or 4 (HCV-3 or HCV-4). 

2. A polynucleotide sequence according to claim 1 which 
is a cDNA sequence. 

3. An HCV-3 or HCV-4 specific polynucleotide sequence 
from the core, NS3 , NS4 or NS5 regions. 

4. An HCV-3 or HCV-4 specific polynucleotide sequence 
comprised within the sequences shown in Figures 1 and la, 
9a, 13 and 15a. 

5. An HCV-3 or HCV-4 polynucleotide sequence which codes 
for an antigenic peptide. 

6. An HCV-3 or HCV-4 polynucleotide sequence which codes 
for C-100, 5-1-1, C33 or C22 peptides having sequence 
alterations unique to HCV-3 or HCV-4. 

7. An HCV-3 polynucleotide sequence encoding the 
polypeptide 

KPALVPDKEVLYQQYDEM or the polypeptide 
ECSQAAPYIEQAQVIAHQF 
or polypeptides of substantially equivalent antigenicity. 
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8. A hepatitus C virus type 2 (HCV-2) polynucleotide 
sequence encoding the polypeptide 

R (A/V) V ( V/I ) (A/T) PDKE ( l/V) lyeafdem or the polypeptide 
ECAS (K/R) AALIEEGQR (M/I) AEML 
or polypeptides of substantially equivalent antigenicity. 

9. An antigenic HCV-3 or HCV-4 specific peptide. 

10. An antigenic HCV-3 or HCV-4 specific peptide from the 
core, NS3, or NS5 regions. 

11. An antigenic HCV-3 specific peptide comprised within 
the sequences shown in Figures 3, 5 or 7. 

12. An antigenic HCV-3 or HCV-4 specific peptide which 
corresponds to c-100, 5-1-1, C33 or C22 proteins having 
sequence alterations unique to HCV-3 or HCV-4. 

13. An antigenic NS4 peptide specific to HCV-3 or HCV-4. 

14. An antigenic HCV-2 specific peptide which is 
R(A/V)V(V/I) (A/T) PDKE (l/V) LYEAFDEM or 

ECAS (K/R) AALIEEGQR (M/I ) AEML 

15. An antigenic HCV-3 specific peptide which is 
KPALVPDKEVLYQQYDEM or 
ECSQAAPYIEQAQVIAHQF 
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16. An antigenic NS4 peptide specific to HCV-4 which 
comprises the HCV-4 sequence substantially from position 
1691 to 1708 or the sequence substantially from position 
1710 to 1728. 

17. A fusion peptide which comprises at least two of the 
antigens of claims 9 to 16. 

18. A fusion peptide according to claim 17 which 
comprises at least one of the antigens of claims 9 to 16, 
fused to B-galactosidase, GST, trp E, or polyhedrin coding 
sequence. 

19. A peptide according to any of claims 9 to 18 which is 
labelled. 

20. A vaccine formulation which comprises an antigen 
according to any of claims 9-18. 

21. Antibody to the antigenic peptide of any of claims 9 
to 19. 

22. An immunoassay device which comprises a solid 
substrate having attached thereto antigen according to any 
of claims 9 to 19. 

23. A device according to claim 22 for HCV screening 
which comprises a mixture of antigens. 
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24. A device according to claim 23 wherein the antigens 
are from the core and NS4 regions. 

25. A device according to claim 22 for HCV-typing which 
comprises a plate having a series of locations 
respectively containing HCV-1, HCV-2, and HCV-3 specific 
antigens. 

26. A device according to claim 25 which further 
comprises a location containing HCV-4 specific antigen. 

27. An immunoassay device which comprises a solid 
substrate having attached thereto antibody according to 
claim 21. 

28. A method of in vitro testing for HCV which comprises 
reverse transcribing any HCV polynucleotide present and 
amplifying by polymerase chain reaction (PCR) , and 
detecting the amplified HCV polynucleotide employing an 
HCV-2, HCV-3 or HCV-4 specific polynucleotide probe. 

29- A method of in vitro testing for HCV according to 
claim 28 wherein the primers employed for PCR are selected 
from those set out in Table 1. 
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30* A method of in vitro HCV typing which comprises 

- carrying out endonuclease digestion of an HCV- 
containing sample employing ScrFI or Haelll/Rsal 
endonuclease; and 

- comparing the restriction patterns with 
characteristic type-specific patterns. 

31. A method according to claim 30 or 31 which further 
employs Hinfl in a separate or combined digestion. 

32. A method of in vitro HCV typing which comprises 

- carrying out endonuclease digestion of an 
HCV-containing sample employing ScrFI endonuclease, 
the restriction pattern being characteristic of 
HCV-1, HCV-2 and HCV-3; 

- carrying out endonuclease digestion employing Hinfl 
endonuclease, the restriction pattern being 
characteristic of HCV-4. 

33. A method of in vitro screening a sample for HCV 
antibodies which comprises; 

- carrying out an immunoassay employing an antigenic 
peptide according to any of claims 1 to 19 or mixture 
thereof; and 

- detecting any antibody - antigen complex produced. 
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34. A method of in vitro EOT antibody typing which 
comprises separately contacting a potentially 
antibody-containing sample or samples with a series of 
HCV-type-specific antigens; and detecting any 
antibody-antigen complex produced with each type-specific 
antigen. 
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FIGURE 2 
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FIGURE 4 
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FIGURE 6 
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FIGURE 8 
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FIGURE 9a 

NUCLEOTIDE SEQUENCES 

4911 4970 

T0040 GACACACCCU GUCACAAAAU ACAUCAUGGC AUGCAUGUCA GCUGAUCUGG AAGUAACCAC 
T0038 ??CACACCCC GUCACAAAAU ACAUCAUGGC AUGCAUGUCA GCUGAUCUGG AAGOAACCAC 
T0036 ??CACACCCC AUCACAAAAU ACAUCAUGGC AUGCAUGUCA GCUGAUCUGG AAGUAACCAC 
T0026 ????CACCCC AUCGCGAAAU ACCUCAUGGC AUGOAUGUCA GCUGAUCUGG AAGUAACCAC 
T1787 ????CACCCC AUCACAAAAU ACCUCAUGGC AUGCAUGUCA GCUGAUCUGG AAGUAACCAC 

Cons GACACACCCU GUCACAAAAU ACAUCAUGGC AUGCAUGUCA GCUGAUCUGG AAGUAACCAC 

4971 5030 

T0040 CAGCACCUGG GUGUUGCUUG GAGGGGUCCU CGCGGCCCUA GCGGCCUACU GCUUGUCAGU 
T0038 CAGCACCUGG GU GUUG C UUG GAGGG GUCC U CGCGGCCCUA GCGGCCUACU G CUUGUC AGU 
T0036 CAGCACCUGG GUGUUGCUUG GAGGGGUCCU CGCGGCCCUA GCGGCCUACU GCUUGUCAGU 
TO 02 6 CAGCACCUGG GUGUUGCUUG GAGGAGUCCU CGCUGCCCUA GCGGCCUACU G CUUGUCA GU 
T1787 CAGCACCUGG GUGUUGCUUG GAGGGGUCCU CGCGGCCCUA GCGGCCUACU GCUUGUCAGU 

Cons CAGCACCUGG GUGUUGCUUG GAGGGGUCCU CGCGGCCCUA GCGGCCUACU GCUUGUCAGU 

5031 5090 

TO 040 CGGCUGCGUU GUGAUUGUGG GUCAUAUUGA GCUGGGGGGC AAGCCGGCA AUCGUUCCAGA 
T0038 CGGCUGCGUU GUGAUUGUGG GCCAUAUUGA GCUGGGGGGC AAGCCCGCA CUCGUUCCAGA 
T0036 CGGCUGCGUU GUGAUUGUGG GCCAUAUUGA GCUGGGGGGC AAGCCGGCA CUC GUUC CAGA 
T0026 CGGCUGCGUU GUGAUUGUGG GUCAUAUUGA GCUGGGGGGC AAGCCAGCA CUCGUUCCAGA 
T1787 CGGCUGCGUU GUGAUUGUGG GUCAUAUUGA GCUGGGAGGC AAGCCGGCA CUCGUUCCAGA 

Cons CGGCUGCGUU GUGAUUGUGG GUCAUAUUGA GCUGGGGGGC AAGCCGGCA AUCGUUCCAGA 

5091 5150 

TOO 40 CAAAGAGGUG UUGUAUCAAC AAUACGAUGA GAUGGAGGAG UGCUCGCAA GCUGCCCCAUA 
TO 03 8 CAAAGAAGUG UUGUAUCAAC AAUACGAUGA GAUGGAGGAG UGCUCGCAA GCCGCCCCAUA 
TO 03 6 CAAAGAGGUG UUGUAUCAAC AAUACGAUGA GAUGGAGGAG UGCUCGCAA GCCGCCCCAUA 
T0026 CAAAGAGGUG UUGUAUCAAC AAUACGAUGA GAUGGAGGAG UGCUCGCAA GCCGCCCCAUA 
T1787 CAAGGAGGUG UUGUAUCAAC AAUACGAUGA GAUGGAGGAG UGCUCGCAA GCCGCCCCAUA 

Cons CAAAGAGGUG UUGUAUCAAC AAUACGAUGA GAUGGAGGAG UGCUCGCAA GCUGCCCCAUA 

5151 5210 

T0040 UAUCGAACAA GCUCAGGUGA UAGCCCACCA GUUCAAGGAG AAAGUCCUU GGADUGCUGCA 
TO 03 8 UAUCGAACAA GCUCAAGUAA UAGCCCACCA GUUCAAGGAG AAAGUCCUU GGAUUGCUGCA 
T0036 UAUCGAACAA GCUCAGGUAA UAGCCCACCA GUUCAAGGAG AAAGUCCUU GGAUUGCUGCA 
T0026 UAUCGAACAA GCUCAGGUAA UAGCCCACCA GUUCAAGGAG AAAGUCCUU GGAUUGCUGCA 
T1787 UAUCGAACAA GCUCAGGUAA UAGCCCACCA GUUCAAGGAG AAAGUCCUU GGGUUGCUGCA 

Cons UAUCGAACAA GCUCAGGUGA UAGCCCACCA GUUCAAGGAG AAAGUCCUU GGAUUGCUGCA 

5211 5270 

TOO 40 GCGAGCCACC CAACAACAAG CUGUUAUUGA GCCAAUAGUA GCUACCAAC UGGCAAAAGCU 
TO 03 8 GCGGGCCACC CAACAACAAG CUGUUAUUGA GCCCAUAGUA GCUACCAAC UGGCAAAAGCU 
T0036 GCGAGCCACC CAACAACAAG CUGUUAUUGA GCCAAUAGUA GCUACCAAC UGGCAAAA??? 
T0026 GCGAGCCACC CAACAACAAG CUGUCAUUGA GCCCAUAGUA GCUACCAAC UGGCAAAA??? 
T1787 GCGAGCCACC CAACAACAGG CUGUCAUUGA GCCCAUAGUA GCUACCAAC UGGCAAAAGCU 

Cons GCGAGCCACC CAACAACAAG CUGUUAUUGA GCCCAUAGUA GCUACCAAC UGGCAAAAGCU 

5271 

T0040 UGAGACC 
TO 03 8 UGAGGCC 
T0036 ??????? 
T0026 ??????? 
T1787 UGAGGCU 



Cons UGAGACC 
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FIGURE 9b 



D E DUCED AHXHD AC23) SEQUENCES 



1638 
v 

T0040 THPVT 
T0038 .HPVT 
T0036 .HPIT 
T0026 -HPI& 
T1787 .HPIT 



KYIMA CMSAD 
KYIMA CMSAD 
KYIMA CMSAD 
KYLMA CMSAD 
KYVMA CMSAD 



LEVTT STWVL 
LEVTT STWVL 
LEVTT STWVL 
LEVTT STWVL 
LEVTT STWVL 



LGGVL AALAA 
LG6VL AALAA 
LGGVL AALAA 
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T0040 RATQQ QAVTE PIVAT NWQKL ETFWH KHM 
T0038 RATQQ QAVTE PIVAT NWQKL EAFWH KHM 

T0036 RATQQ QAVTE PIVAT NWQ 

T0026 RATQQ QAVTE PIVAT NWQ 

T1787 RATQQ QAVTE PIVAT NWQKL EAFWH KHM 



Cons RATQQ QAVTE PIVAT NWQKL EAFWH KHM 
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